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Abstract 



This thesis consoUdates, improves and extends the smooth en- 
tropy framework for non-asymptotic information theory and 
cryptography. 

We investigate the conditional min- and max-entropy for quan- 
tum states, generahzations of classical Renyi entropies. We in- 
troduce the purified distance, a novel metric for unnormalized 
quantum states and use it to define smooth entropies as op- 
timizations of the min- and max-entropies over a ball of close 
states. We explore various properties of these entropies, includ- 
ing data-processing inequalities, chain rules and their classical 
limits. The most important property is an entropic formulation 
of the asymptotic equipartition property, which implies that the 
smooth entropies converge to the von Neumann entropy in the 
limit of many independent copies. The smooth entropies also 
satisfy duality and entropic uncertainty relations that provide 
limits on the power of two different observers to predict the 
outcome of a measurement on a quantum system. 

Finally, we discuss three example applications of the smooth 
entropy framework. We show a strong converse statement 
for source coding with quantum side information, character- 
ize randomness extraction against quantum side information 
and prove information theoretic security of quantum key dis- 
tribution using an intuitive argument based on the entropic 
uncertainty relation. 
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Abbreviations and Notation 



Abbreviation 


Description 


CPM 


Completely positive map 


TP-CPM 


Trace-preserving completely positive map 


SDP 


Semi-definite program 


POVM 


Positive operator-valued measurement 


QKD 


Quantum key distribution 


i.i.d. 


Independent and identically distributed 


UCR 


Uncertainty relation 


CQ 


Classical-quantum (e.g. CQ-states) 


Ihs. 


Left-hand side (of an equation) 


rhs. 


Right-hand side (of an equation) 



Table 1: List of Abbreviations. 



Symbol 

A,B',AD 
X,Y,Z 



tr„. 



d-A 

S={'K) 

S<{-K) 

p,f,uj\,aBc 

Itp), Vabc 
t^a.Iaa' 

Dip,T) 
F{p,T) 
PiP,r) 



Description 

Typical physical systems and joint systems 
Typical registers (random variables) 
Hilbert spaces corresponding to a joint quantum and 
to a classical system 

Dimension of the system A, cIa = dim{^^} 
Partial trace over subsystem A 
Normalized quantum states on 'K 
Sub-normalized states on !K 

Typical (possibly sub-normalized) quantum states 
Typical pure states, i.e. rank 1 quantum states 
Completely mixed state on A and maximally entan- 
gled state between A and A' 
Generalized trace distance between p and r 
Generalized fidelity between p and r 
Purified distance between p and r 



Table 2: Notational Conventions for Quantum Mechanics. 
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Symbol 


Description 


log = log2 


Binary logarithm, i.e. logarithm to the basis 2 


In 


Natural logarithm 


M,C 


Real and complex numbers 


Ji,Ji' 


Typical Hilbert spaces 


\v),w),m 


Typical elements of a Hilbert space, kets 


{i^\A4>\ 


Typical functionals on a Hilbert space, bras 


tr,tr5, 


Trace and partial trace over 'K 


£(:k),£(5{,:k') 


Linear operators on "K and from "K to 'K' 


n{:K),v{j{) 


Hermitian and positive semi-definite operators on "K 


A> B 


Equivalent to A- B €V{Ji) 


xt 


The adjoint operator of X 


^T 


The transpose operator of X, defined with regards 




to a basis that needs to be specified 


x-1 


The generalized inverse operator of X 


ll^lloo 


Maximum singular value of X, operator norm 


1 


Identity operator 


E,X,L 


Typical linear operators on a Hilbert space 


M,N 


Typical positive semi-definte operators 


u,v,w 


Typical unitary operators or isometries 


P,U 


Typical projectors 


£,J^,I 


Typical trace-preserving completely positive maps 



Table 3: Notational Conventions for Mathematical Expressions. 



Chapter 1 

Introduction 



This chapter starts with a rather philosophical introduction into quantum 
mechanics that does not assume any prior knowledge. Here, we attempt to 
explain and justify the information theoretic approach this thesis will take 
on the topic. This will lead into a short overview over some relevant aspects 
of information and quantum information theory. Then, we discuss the 
importance of non-asymptotic quantum information theory to characterize 
elementary information processing tasks. Finally, the introduction ends 
with a detailed outline of the thesis. 

1.1 Quantum Mechanics 

The laws of quantum mechanics govern the behavior of microscopic physi- 
cal systems and are verified daily in experiments conducted in physics labo- 
ratories worldwide. Here, we restrict our attention to non-relativistic quan- 
tum mechanics and take a static perspective on quantum theory, where the 
state of a physical system is the central object of interest. 

Non-relativistic quantum mechanics is the theoretical basis of today's 
semiconductor industry. As we further miniaturize physical devices used 
for information processing, the impact of quantum mechanics will become 
more and more relevant. Hence, a thorough understanding of quantum 
physics will be pivotal to successfully engineer the next generation of in- 
formation processing devices. In quantum cryptography [BB84, Eke91], 
some of the more peculiar effects of quantum mechanics are already ex- 
ploited today in order to ensure secrecy of the communication between two 
distant parties. 

While an understanding of the laws of quantum mechanics is thus nec- 
essary in order to fully comprehend the physical world surrounding us, 
these laws are nevertheless in stark contrast with our intuition about the 
causal structure of the universe. In our everyday experience, any observa- 
tion about a physical system can be predicted perfectly given a complete 
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objective description of the state of said system. For example, position, 
momentum and spin of a football are part of its objective state and every 
observer can verify their value independently, given appropriate measure- 
ment equipment. This perspective is deeply ingrained in our language: We 
talk about the position, momentum and spin of a football, directly asso- 
ciating our observations (the outcomes of position, momentum and spin 
measurements) with the football and, thus, implying that there exists an 
objective reality — the state of the football — beyond our observations. 

However, quantum mechanics does not allow for an objective descrip- 
tion of the state of a physical system that deterministically predicts all ob- 
servations about said system. For example, it is impossible to write down 
a quantum mechanical state of an electron such that the outcomes of both 
position and momentum measurements can be predicted with arbitrary 
precision. This is known as the Heisenberg uncertainty principle [Hci27] 
and has mystified physicists since the early days of quantum mechanics. 
In what sense can position and momentum then be considered real? Is it 
even permissible to speak of the position and momentum of a quantum 
mechanical object like an electron? 

Furthermore — as if the loss of determinism was not enough to confuse 
our human minds — quantum mechanics generally does not even allow for 
an objective description of the state of a physical system that determines 
the probabilities with which different observations about it are made. These 
probabilities, even if the description of the state is complete within quan- 
tum mechanics, are in general subjective to the observer. It is crucial 
to note here that classical theory allows to describe subjective knowledge 
about the state of a system, for example through conditional probability 
distributions. (We will encounter an example model of subjective classi- 
cal information in the next section.) However, such a description cannot 
be considered complete within classical theory, since, in principle, every 
observer may hold a copy of the full objective state of the system, e.g. 
position, momentum and spin of the football. 

To convince yourself that quantum mechanics is incompatible with this 
notion of objectivity, consider two particles that are in an entangled state. 
More specifically, this could be two electrons, A and B, that are in a 
spin singlet state. In this setting, quantum mechanics postulates that an 
observer controlling electron B can perfectly predict the outcomes of all 
possible spin measurements on electron A} We call this observer an omni- 
scient observer of the electron A. At the same time, all other observers are 
ignorant about A, which means that they will see all possible measurement 
outcomes with equal probability. Quantum mechanics, in contrast to clas- 
sical theory, does not allow to copy the quantum information encoded in 



^The observer simply measures the spin of B in the same direction as the spin of A 
is measured to get a fully anti-correlated result. 
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the state of electron B and share it with other observers. On an intuitive 
level, this is often explained using the concepts of no-cloning [WZ82] or the 
monogamy of entanglement. These statements can be formulated quanti- 
tatively and we will see in Chapter 4 that the fact that one observer is close 
to omniscient about a quantum system implies that all other observers are 
necessarily close to ignorant about it. 

This apparent deficiency of quantum mechanics famously led Einstein, 
Podolsky and Rosen to ask "Can quantum- mechanical description of phys- 
ical reality be considered complete?" [EPR35]. The search for a theory, 
consistent with quantum mechanics, that would assign objective descrip- 
tions^ to physical systems that probabilistically predict observations about 
said systems, was abandoned when Bell [Bcl64] as well as Kochen and 
Specker [KS68] proved their pivotal theorems. They show that such ob- 
jective descriptions cannot fully predict the probabilities of certain obser- 
vations that are consistent with quantum mechanics and can be verified 
experimentally. ^ 

Furthermore, the existence of preferred observers renders quantum cryp- 
tography possible. In quantum key distribution, we consider two players, 
traditionally called Alice and Bob, who want to share a secret over a public 
channel. Once the two players can establish that Bob is almost omniscient 
about a quantum system Alice controls, a shared secret can be produced 
by an arbitrary measurement of Alice's system: while Bob can predict Al- 
ice's observed measurement result well, by the laws of quantum mechanics, 
any eavesdropper is guaranteed to have almost no knowledge about it. 



1.2 Information Theory 

Luckily, some of the counter-intuitive effects of quantum mechanics — in 
particular, the appearance of non-determinism and subjective knowledge 
as explained above — have also been considered in a completely different 
context, in probability and information theory. The latter was founded by 
Nyquist [Nyq28] and Shannon [Sha48] in the early days of telegraph com- 
munication in order to investigate information sources and the capacities 
of channels to transmit digital information. 

Following Shannon, consider a source that outputs English text. Clearly, 
the different letters in the Latin alphabet will not be produced with equal 
probability by such a source. This is used, for example, in the Morse 



^Such objective descriptions of a system are usually called (local) hidden variables 
in this context, since they are not accessible from within quantum mechanics. 

^In fact, it was recently shown that if such objective descriptions have any predictive 
power exceeding quantum mechanics, they can be falsified experimentally under minimal 
assumptions. [CR08, CRIO] 
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alphabet, where the most frequent letters are given the simplest codes. ^ 
Going one step further, note that certain combinations of letters are more 
likely than others. For example, the letter 'q' is almost always followed by 
a 'u'. Hence, given that the last letter this source produced was a 'q', the 
information content, or uncertainty, of the next letter is very low. 

This implies that the probability with which letters occur is relevant in 
order to characterize the information content of the source — a source that 
produces every letter with equal probability produces more information 
than a source that is biased towards certain letters. To quantify these con- 
siderations. Shannon used entropies. Very generally, he considered events 
that are known to occur with a certain probability p and assigned to them 
the value —logp, the surprisal, which measures how surprising the occur- 
rence of the event is. (The logarithm is taken to the binary basis through- 
out this thesis.) If an event occurs with certainty its surprisal is zero and 
as the event gets less likely its surprisal can grow arbitrarily large. In the 
above example, the surprisal of the letters 'e' and 't' is smaller than the 
surprisal of less common letters. Moreover, the surprisal of the letter 'u' 
given that the last letter was 'q' is close to zero. 

The Shannon entropy of a source is the average surprisal of the events it 
produces and is introduced more formally in the following. For the purpose 
of this thesis, a probability distribution on a random, variable X that takes 
values from a discrete set Af is a function Px from X to non-negative real 
numbers with the property that '^^ex ^x{x) = 1. The Shannon entropy 
of a random variable X with distribution Px is then defined as 



HiX)p:=-Y,Pxix)logPx{x). (1.1) 



xGX 

Here, —logPx{x) is the surprisal of the event "X = x" and has the 
properties described above. On one hand. Shannon [Sha48] and later 
Renyi [Ren61] derived the mathematical form of the entropy, Eq. (1.1), 
from intuitive axioms that measures of the average surprisal should sat- 
isfy. On the other hand, the Shannon entropy is intimately related to 
physics and it was inspired directly by the Gibbs entropy of thermody- 
namics [Gib76, Bol72]. 

Next, we consider the case where we have side information about X, 
modeled as another random variable Y. The two random variables, X and 
Y, have a joint probability distribution, PxY{x,y)- The marginal proba- 
bility distributions of the individual random variables X and Y are thus 
given as Px{x) = J2yPxYix,y) and Py(y) = ^xPxYix,y), respectively. 
We employ the conditional Shannon entropy of X conditioned on Y, which 
is defined as H{X\Y)p := H[XY)p — H(Y)p. This definition, aside from 



^The most frequent letters 'e' and 't' are given the codes ' ■ ' and ' — ', respectively, 
whereas all other letters have codes with at least two symbols. 
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its operational interpretation that we wih discuss below, has very natural 
properties that we would expect from a conditional entropy, i.e. a measure 
about the average surprisal or uncertainty of X given side information 
Y. For example, we have H{X\Y)p < H{X)p, namely, the uncertainty 
about X increases if one ignores the side information. Equality holds if 
the side information is independent of X, i.e. if Pxy{x, y) = Px{x)PY{y)-^ 
Relations of this type are called data processing inequalities and will be 
discussed in Chapter 5. 

Conditional entropies can be employed to model subjective classical 
information.^ Consider the example of an unbiased die that is thrown 
secretly in a cup. The state of the die before the result is revealed is mod- 
eled as a random variable, X, on the set {B, □, 0, S, (S), 0} of faces with 
probability distribution Px{') = «• The Shannon entropy or uncertainty 
about the outcome is simply H{X)p = log 6 « 2.58. However, an ob- 
server may own a futuristic device, Y , that correctly predicts whether the 
value is even or odd with probability | from the sound the die emits when 
thrown. The correlations between the result of the throw and this device 
can be described by a joint probability distribution Pxy satisfying, e.g., 
Pxy(H, "odd") = I • f = I and Pxy(0, "even") = i • i = ^. The uncer- 
tainty this observer has about the outcome is reduced to H{X\Y)p w 2.39. 
Another observer may own an even more powerful device, Z, that predicts 
the value of X with certainty; hence, H{X\Z)p = 0. The subjective un- 
certainty an observer has about the outcome of the throw thus critically 
depends on the available side information^ Y or Z. 

The Shannon entropy has found a vast number of applications in in- 
formation theory. Shannon's source coding theorem [Sha48] concerns itself 
with the question of how much we can compress the output of a source. If 
a source produces a long stream of independent and identically- distributed 
(i.i.d.) symbols X, it states that any attempt to compress the output to less 
than H{X)p logical bits per symbol will almost certainly lead to informa- 
tion loss. Here, Px is the probability distribution of the source symbols. 
On the other hand, it is possible to compress the stream to arbitrarily 
close to H{X)p bits per symbol with insignificant probability of informa- 



^More generally, and in accordance with our intuition, any function applied to the 
random variable Y can at most increase the uncertainty about X, i.e. H{X\Y)p < 
H{X\Z)p \i Z ^ f{Y). The entropy H{X\Z)p is evaluated for the probability distribu- 
tion induced by /, i.e. the distribution Pxz(x, z) = X^ .t, w^ Pxy{x, y). 

^For another example, we return to Shannon's text sources and treat them quan- 
titatively. Let us first consider a source that outputs all 26 letters of the English al- 
phabet with equal probability. Clearly, the entropy of the output U of this source is 
H{U) = log 26 ~ 4.7. A source that outputs letters of English text, X, has lower 
entropy — H{X) ~ 4.14 [Slia51] — due to the non-uniform distribution of the different 
letters. Moreover, if we consider two consecutive letters of English text, denoted X\ and 
X2, we find that the conditional entropy — H{X2\X\) ~ 3.56 [Sha51] — is even smaller 
due to correlations between the probabilities of adjacent letters. 
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tion loss. More generally, if some additional side information Y about each 
X is available, the quantity H{X)p can be replaced by H{X\Y)p in the 
above statements [SW73], where Pxy is the joint probability distribution 
of source and side information. The Shannon entropy thus quantifies the 
amount of memory (in bits) needed to store the output of a source reli- 
ably. Memory can be considered a resource in source coding and, more 
generally, the Shannon entropy is often used to analyze the resource usage 
of a task in the limit of many independent and identical repetitions. In 
the following, we call this the i.i.d. limit 

As a further important example, the capacity of a channel to transmit 
information (in the i.i.d. limit of many independent uses of a memoryless 
channel) can be expressed in terms of Shannon entropies [Sha48] . 

1.3 Quantum Information Theory 

An observation about a physical system, in the sense described above, 
constitutes an event. Moreover, a measurement, i.e. a complete set of mu- 
tually exclusive observations about a physical system, can be seen as an 
information source. It is thus very natural to try to model quantum mea- 
surements using the entropy formalism of the previous section. Moreover, 
many information theoretic tasks (we will discuss the example of source 
compression in detail) can be generalized to the quantum setting and thus 
the question arises whether the resource usage in these tasks can charac- 
terized using an analogue of the Shannon entropy as well. We will see that 
this is indeed possible. 

In the quantum formalism, to be formally introduced in Chapter 2, a 
random variable is modeled as a (classical) register, X and the probability 
distribution, Px, is represented as a diagonal matrix with the probabili- 
ties i-'x(x) as entries. This is a special case of a quantum system. More 
generally, the state of a quantum system. A, is modeled as a positive semi- 
definite operator with unit trace, pA, called the density operator or quan- 
tum state. Schumacher and Wootters [Sch95] introduced the term qubit 
to denotes the smallest unit of quantum information. The spin degree of 
freedom of an electron, as we have seen before, constitutes a physical ex- 
ample of such a qubit system. Holevo's bound [Hol73] implies that a qubit 
can only store one bit of classical information. This indicates that the 
power of quantum information over classical information lies in the possi- 
ble correlations between multiple quantum systems and not necessarily in 
the quantum systems themselves. 

The von Neumann entropy [vN32] of a quantum system is given as 

H{A)p := -logtr(/)^logpA), 
where tr denotes the trace. For a register X this expression reduces to 
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the Shannon entropy. A measurement, in this formahsm, is simply a map 
from a quantum system to a register. 

To see a first application of the von Neumann entropy in quantum in- 
formation theory, we consider a simple uncertainty relation. For example, 
let A be qubit, for example an electron spin in a state Pa and let X be 
a register containing the outcome of either one of two incompatible mea- 
surements that are chosen at random, for example spin measurements in 
the X or y direction. Then, an entropic formulation of Heisenberg's uncer- 
tainty principle tells us that H{X)p > j- [MU88]. Hence, independently 
of how the initial state Pa is prepared (as long as there is no quantum side 
information present), there will be uncertainty about at least one of the 
outcomes. Such uncertainty relations, expressed in terms of entropies, are 
the topic of Chapter 7. 

In order to investigate the effect of quantum correlations, we introduce 
the conditional von Neumann entropy. For a bipartite quantum state Pab, 
this is given as H(A\B)p := H{AB)p — H{B)p. If the system A is a classi- 
cal register, this entropy is nonnegative and can safely be interpreted as a 
measure of uncertainty given quantum side information. More explicitely, 
we reconsider the example where two electrons, A and i3, are in a spin 
singlet state. As we have noted before, quantum mechanics predicts that 
a preferred observer controlling the electron B can predict any measure- 
ment outcome on the A electron with certainty. Thus, the surprisal of any 
observation about electron A is zero for this observer. More formally, we 
denote by X the random variable that stores the outcome of an arbitrary 
spin measurement on the electron A. Then, the conditional von Neumann 
entropy evaluates to H{X\B)p = 0. Here, the entropy is evaluated for the 
post-measurement state pxB that results from measuring X on the joint 
quantum state Pab- On the other hand, all other observers will see a uni- 
formly random measurement outcome and thus have maximum surprisal. 
We denote by C such an ignorant observer. And indeed, for any tripartite 
quantum state Pabc that is compatible with the marginal state Pab , we find 
H{X\C)p = 1. Again, the entropy is evaluated for the post-measurement 
state pxc that results from measuring X on the joint state Pac- 

However, the conditional von Neumann entropy can also be evaluated 
for the quantum state Pab of the electron pair before measurement. This 
entropy evaluates to H{A\B)p = —1 in the example above. Generally, the 
conditional entropy can be negative in the presence of entanglem,ent and 
its interpretation as a measure of uncertainty has been controversial. (See, 
e.g., [HOW05, BCC+10, dRAR+11] for recent work elucidating the issue.) 

Generalizing the source coding theorem with side information of the 
previous section, we ask how much we can compress a long stream of i. i. d. 
random variables X such that an observer with quantum side information 
B about each X can reconstruct the original from the compressed stream. 
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Such side information can be modeled in a joint quantum state pxs- The 
answer was given by Devetak and Winter [DW03], who propose a quantum 
generahzation of the Slepian-Wolf theorem. They show that, consistent 
with the classical result, the stream can be compressed to H{X\B)p bits 
per symbol in the i.i.d. limit of long streams. 

In another generalization of source coding, we ask how much we can 
compress a long stream of independent and identical quantum systems A 
that are in the state Pa- The answer, that such a string can be compressed 
to H{A)p qubits, was given by Schumacher [Sch95] in his pioneering work 
on quantum information theory. 

We can go one step further into the quantum world and consider state 
merging [HOW06]. Here, we start with joint quantum state Pab shared 
between two parties, A and B. The task is to recreate the state (including 
its correlations with the environment) at B using free classical communica- 
tion between A and B and entanglement between A and B a,s a. resource. 
The amount of this resource that needs to be utilized (in the i.i.d. limit 
of many repetitions of the task) is then quantified by the conditional von 
Neumann entropy, H{A\B)p. The conditional entropy can be consistently 
interpreted as the amount of entanglement needed to complete the task. 
In particular, if H{A\B)p is negative (which may happen only in the pres- 
ence of entanglement in Pab), it is possible to extract —H{A\B)p units of 
entanglement from the state while recreating it at B. 

1.4 Non- Asymptotic Information Theory 

So far, we have considered tasks in the i.i.d. limit of many repetitions and 
found that the resource usage is characterized by expressions involving the 
Shannon and von Neumann entropies. This leads to the question of what 
happens to the resource usage when we consider a finite number of trials 
and allow a small probability of failure. In many context, in particular in 
cryptography, we are even interested in the amount of resource needed to 
perform a task just once, with high probability of success. We call this the 
one-shot setting in the following. These questions, which we are going to 
tackle in the quantum setting, have also been the topic of recent research 
in classical information theory (see, e.g., [PPVIO, PoUO, WCR09]). 

It turns out that the Shannon entropy and von Neumann entropies, 
which have been invaluable tools in the previous sections, are insufficient 
to characterize the required resources in the one-shot setting. Nevertheless, 
there are other entropies that can be used instead to quantify uncertainty, 
as we will see in the following. 

We will now introduce some aspects of non-asymptotic information 
theory on the example of source compression. For this purpose, let us 
consider a source X with probability distribution Px ■ The number of bits 
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of memory needed to store the output of this source so that it can be 
recovered with certainty is given by \Hq{X)p'] , where Hq{X)p denotes the 
Hartley entropy [Har28] of the distribution Px, defined as 

HoiX)p ■.= log\{x : Pxix) > 0}\, 

The Hartely entropy corresponds to the Renyi entropy of order [RcnGl] 
and simply measures the size of the support oi X. As an example, we 
consider again a source that outputs characters of the English alphabet. If 
we want to store a single character produced by this source such that it can 
be recovered with certainty, we clearly need [log 26] = 5 bits of memory 
as a resource. This first result is rather unsatisfactory since the resource 
usage does not depend on the actual probability distribution of the letters 
but only on the size of the alphabet. 

More interestingly, we may ask how much memory we need to store the 
output if we allow a small probability of failure, e. One way to tackle such 
problems is by investigating encoders that assign code words (i.e. binary 
strings) of a fixed length m (in bits) to the events a source produces. 
These code words are then stored in m bits of memory and a decoder is 
later used to compute an estimate of X from that memory. For a source X 
with probability distribution Px, we are thus interested in the minimum 
code length, rrf{X)p, for which there exists an encoder and a decoder that 
achieve a probability of failure not exceeding e. Gallager [Gal79] showed 
that a random assignment of source events to code words on average leads 
to a failure probability of at most e if the code length is sufficiently long. 
His results imply that the minimal code length satisfies 

m'{X)p < H^,,{X)p + log ^ + 1. (1.2) 

Here, H^^^{X)p denotes the max-entropy, which corresponds to the Renyi 
entropy of order 2 and is defined as 

Upper bounds of the type (1.2) are called direct bounds and show that 
there exist protocols using a certain amount of resource that do not exceed 
a fixed probability of failure. 

This analysis can be further refined by smoothing the max-entropy. For 
this purpose, let us consider probability distributions Qx that are close to 
Px and have max-entropy H^^{X)q smaller than H^g_^{X)p. In principle, 
we could design the encoding and decoding scheme for a source with distri- 
bution Qx instead of Px- Clearly, this reduces the upper bound in (1.2). In 
fact, the upper bound can be expressed in terms of a smooth max-entropy, 
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H^^^{X)p := infQ~pi?^^^3^(X)(5, which minimizes the max-entropy over 
probabiUty distributions Qx that are within statistical distance ei of Px-^ 
More precisely, they satisfy D{Px,Qx) '■= 5 J2x \Px{x) — Qx{x)\ < ei- 

However, if this scheme is apphed to the original source, we incur an 
additional error that depends on the statistical distance between Px and 
Qx- The total probability of failure, e < ei + £2; is thus split into two 
contributions: the statistical distance of the distributions, D{Px,Qx) < 
ei, and the contribution from the Gallager bound, which we denote £2- 
Hence, we get improved direct bounds on the minimal code length: 

m%X)p <H^^^^{X)p + log— + 1, V£i,e2 with £1 + 62 = £. (1.3) 

We also consider converse bounds that give a lower bound on the re- 
sources required to achieve a certain probability of success. In fact, it can 
be shown that 

m%X)p > H^,^{X)p. (1.4) 

Hence, both the lower and upper bound on the quantity m^{X)p can be 
expressed in terms of a smooth max-entropy. We thus say that the required 
memory for one-shot source compression is characterized by the smooth 
max-entropy. (Note that a more detailed analysis of source compression 
with quantum side information can be found in Chapter 8 and [RRIO].) 

To see why the Shannon entropy does not suffice to characterize the 
one-shot version of source compression, consider a source that produces the 
symbol 'a' with probability ^ and k other symbols each with probability 
2^. On one hand, for any fixed failure probability, the converse bound 
in (1.4) evaluates to approximately log A; for large enough k. This implies 
that we cannot compress this source much beyond the Hartley entropy. 
On the other hand, the Shannon entropy of this distribution is ^(log k-\-2) 
and underestimates the required resources by a factor two. 

In this thesis, we will mainly encounter two entropic quantities that are 
defined for quantum states in Chapters 4 and 5. Surprisingly, it turns out 
that these two entropies suffice to characterize the resource usage of many 
information theoretic tasks in the one-shot setting in a manner similar to 
source compression. The first quantity, the smooth min-entropy , -f^min' i^ 
a generalization of the min-entropy or Renyi entropy of order 00, which 
evaluates the minimum surprisal of a random variable X. Namely, 

Hmm{X)p ■= min-logPx(a:), 

x£X 



^Note that we use the statistical distance as a metric here for convenience of expo- 
sition. In Chapter 3, we will argue that the purified distance should be used instead to 
define the smooth entropies in the quantum setting. All smooth entropies used in this 
thesis, except in this introductory exposition, are thus based on the purified distance as 
a metric. 
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One of the major applications of the smooth min-entropy is in randomness 
extraction, where it characterizes the amount of uniform randomness that 
can be extracted from a biased source [ILL89]. The second quantity, the 
smooth max-entropy, H^^^, is used to characterize source compression, as 
we have seen above. 

The smooth min- and max-entropies were first generalized to the quan- 
tum settings by Renner and Konig [RK05, Rcn05]. The smooth min- 
entropy we use in this thesis is a refined version of the smooth min-entropy 
proposed in [Rcn05], whereas the smooth max-entropy is based on later 
work by Renner, Konig and Schaffner [KRS09] . The significance of these 
extensions to the quantum setting stems from their operational meaning. 
For example, the quantum generalization of the min-entropy, Hf^^^^{X\B)p, 
characterizes randomness extraction against quantum side information B, 
i.e. it characterizes the amount of uniform randomness, independent of 
the side information B, that can be extracted from X [Rcn05, TSSRll]. 
The quantum generalization of the max-entropy, H^g^^{X\B)p, character- 
izes source compression with quantum side information B in the one-shot 
setting [RRIO]. Moreover, the amount of entanglement needed in one-shot 
state merging is characterized by H^^^{A\B)p [BcrOS]. We will discuss 
source compression with quantum side information as well as randomness 
extraction against quantum side information in Chapter 8. 

In addition to their operational meaning, the smooth entropies exhibit 
many useful properties, including data processing inequalities and chain 
rules. Furthermore, the smooth min- and max-entropies converge to the 
von Neumann entropy in the i.i.d. limit. For any < e < 1, 

lim -H^^,jX-\B-)p= lim -H^^,jX-\B-)p = H{X\B). 

n— s>oo n n^-oo n 

We call this the entropic form of the asymptotic equipartition property and 
it is the topic of Chapter 6. 

This means that if the resource usage is characterized by a smooth 
entropy in the one-shot setting, the resource usage in the i.i.d. limit is 
given by the von Neumann entropy. In fact, a simple analysis shows the 
upper and lower bounds in (1.3) and (1.4) on the code rate, r = ^m^(X"), 
converge to the von Neumann limit when we let n go to infinity for any 
fixed < e < 1. This also shows the advantage of the smoothed direct 
bound (1.3) over the Gallager bound (1.2), for which such an asymptotic 
convergence can only be shown using additional techniques. 

The smooth entropies satisfy a duality relation. For any tripartite 
quantum state Pabc and any < e < 1, we find [KRS09, TCRIO] 

H^,^{A\B)p > -Hf^^JA\C)p (1.5) 

and equality holds if the joint state Pabc is pure.^ This relation provides 



^Pure states offer the most complete description of a joint quantum system. 



17 



1. INTRODUCTION 



a connection between the min- and max-entropy, and thus the tasks char- 
acterized by them, that does not exist in classical information theory. 

It also allows us to close the circle to the discussion of quantum mechan- 
ics at the beginning of this chapter. For this purpose, let B and C be two 
observers of a quantum system A. Then, the min-entropy, H^^^{A\B), can 
be viewed (cf. Chapter 4) as the distance of B to an omniscient observer 
of the quantum system A. Furthermore, the max-entropy, —H^^^{A\C), 
can be viewed as the distance of C to an ignorant observer of A. The 
duality relation, Eq. (1.5), thus states that if B is close to an omniscient 
observer of the quantum system A, then C is at least as close to an igno- 
rant observer of A. This can be seen as a manifestation of the subjective 
knowledge of observers quantum mechanics imposes on the world. 

1.5 Goal and Outline 

The goal of this thesis is to consolidate the smooth entropy framework for 
non-asymptotic information theory and to introduce important additions 
to the framework, including the entropic asymptotic equipartition property 
and various uncertainty relations. This work should provide a reference for 
researchers interested in the smooth entropy framework for non-asymptotic 
quantum information theory. The focus of this work is thus mainly on the 
properties of the smooth entropies and not on their applications. 

The remainder of this thesis is organized as follows. 

In Chapter 2, the notation and mathematical foundations of Hilbert 
space quantum mechanics are introduced. Relevant results of linear algebra 
are summarized in Section 2.1 and the axioms of quantum mechanics are 
introduced in 2.2. Moreover, Section 2.3 covers operator convex functions 
and semi-definite programming, completing a mathematical toolkit that 
will be used extensively in this thesis. 

In Chapter 3, we introduce a novel measure of distance between (po- 
tentially incomplete) quantum states, the purified distance. In particular, 
we explore its properties and argue why they are relevant for the definition 
of the quantum smooth entropies. 

In Chapter 4, we formally define the min- and max-entropies for quan- 
tum states, give a collection of different expressions for the entropies and 
explore some of their properties. In particular, we explore the relation be- 
tween the min- and the max-entropy and their relation to classical Renyi 
entropies. We also show that both entropies are continuos functions of the 
quantum state. 

In Chapter 5, we introduce smoothing and define the smooth min- and 
max-entropy. Various properties of the smoothing operation are discussed. 
We establish relations between the smooth min- and max-entropy and 
investigate data processing inequalities. The special case where one or 
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more systems are classical is considered in detail. We also give a list of 
chain rules that have recently been shown for the smooth min- and max- 
entropies. 

In Chapter 6, we show that the smooth entropies converge to the von 
Neumann entropy when we consider a sequence of independent and iden- 
tically distributed quantum systems. This result is an entropic version of 
the asymptotic equipartition property and confirms the fundamental role 
of the von Neumann entropy in quantum information theory. In addition 
to the asymptotic result, the chapter also provides bounds for finite block 
lengths. 

In Chapter 7, we introduce a variety of entropic uncertainty relations 
that give bounds, analogous to Heisenberg's uncertainty principle, on the 
uncertainty of the outcomes of two incompatible measurements on a quan- 
tum system. These uncertainties are expressed in terms of smooth en- 
tropies as well as von Neumann entropies, which makes them directly ap- 
plicable to problems in quantum cryptography. 

In Chapter 8, we combine results from Chapters 3-6 to investigate 
two information theoretic tasks, source compression with quantum side 
information and randomness extraction against quantum side information. 
In particular, we show a strong converse statement for source compression 
which implies that any attempt to compress to less than the Shannon limit 
will fail with high probability. These results can then be used in another 
example application, quantum key distribution. There, we show how the 
entropic uncertainty relations of Chapter 7 can be employed in order to 
prove security of the original BB84 quantum key distribution protocol in 
a concise and intuitive way. 

The thesis ends with Chapter 9 in a short conclusion and outlook. 
Some open problems are discussed. 
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Chapter 2 

Preliminaries 



The preliminaries consist of three sections. The first two sections, which 
cover the mathematical foundations of Hilbert space quantum mechanics 
in finite dimensions, can be skipped entirely by readers already familiar 
with the concepts of linear algebra used in quantum information theory. 
The notation used throughout this thesis is introduced in these sections 
and summarized in Tables 3 and 2. The last section then introduces some 
mathematical tools that are needed to derive the results of the follow- 
ing chapters, including semi-definite programming and operator monotone 
functions. 

2.1 Linear Algebra on Hilbert Spaces 

This section is based on many introductory text books, mostly on the two 
books Matrix Analysis and Positive Definite Matrices by Bhatia [Bha97, 
Bha07] . Moreover, John Watrous's lecture notes [Wat08] as well as Nielsen 
and Chuang's Quantum Computation and Quantum Information [NCOO] 
were an invaluable resource. 

2.1.1 Hilbert Spaces 

Bras and Kets 

Let ^ be a finite-dimensional vector space over the complex numbers 
equipped with an inner product (•,•): 3f x J{ —;• C. In the following, 
we will call !K a Hilbert space. The dual space of 'K is the Hilbert space 
of (linear) functionals from "K to C. We use Dirac's bra-ket notation to 
denote elements of 'K and its dual space, 'K* . Every ket, {tp) £ "K, is in 
one-to-one correspondence with its dual bra, (■(/'I G ^*. The bra is defined 
in terms of the ket via the Hilbert space's inner product as 

{i;\:\^)^{i^\ip):={\i;),\^))- (2.1) 



21 



2. PRELIMINARIES 



The right-hand side of the above equation gives a natural expression 
for the inner product in terms of a bra-ket product, and we wih use this 
notation frequently. The bra-ket product (•!•) in (2.1) has the following 
properties, which follow directly from properties of the underlying inner 
product. 



• Conjugate symmetry: (V^lv^) = (v'lV')- 

• Sesquilinearity: Let If^i), |'i?2) G ^ and ai,a2 G C such that \ip) = 
oil^i) + «2|f?2)- Then, {^p\ip) = ai('0|'!9i) + 02(^1^2) and, due to 
conjugate symmetry, {^p\^p) = ai{'di\tp) + Q;2(??2|V')- 

• Positive-definiteness: {il^\ip) > with equality if and only if \ip) = 0, 
where is the zero element of the vector space. 

Norms and Metrics 

The usefulness of Hilbert spaces in physics stems in part from the fact 
that they have a natural measures of angle and distance through the inner 
product. In fact, the inner product of a Hilbert space 5C induces a norm, 
II -f : IK — )• M, which in turn induces a metric, D{-, •) : ?{ x ?{ — >• M. They 
are given by the expressions 

|||V^)|| :=7(V#) and D{\i;),\^)) := \\\i;) - \^)\\. 

More generally — and for later reference — a metric on an arbitrary set 
is defined as follows. 

Definition 2.1 (Metric). Let X be a set and let a,b,c G X. Then, the 
functional D:A'xA'— T-Misa metric on X if it satisfies 

1. Positive-definiteness: D{a,b) > with equality if and only if a = b. 

2. Symmetry: D{a,b) = D{b,a). 

3. Triangle inequality: D{a,c) < D{a,b) + D{b,c). 

It is easy to verify that the induced metric introduced above fulfills these 
conditions. 

Orthonormal Bases 

Let S be a set of kets |i?i), \'&2), ■ ■ ■ , \'&n) G ^- The linear span of S is the 
subspace of !K containing all linear combinations these kets, 

n 

span{S} = span{|^i), |i?2), • • • , \^n)} ■= { ^a^l^*) : «i e c}. 
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We call 'B a basis oCK if span {23} = Ji. Furthermore, a basis {|ej)}f^]^ := 
{|ei), |e2), . . . , \ed)} is called orthonormal if its elements are mutually or- 
thonormal. Formally, this means that (ej|ej) = Sij, where the indicator 
function is given by 6ij = 1 if i = j and otherwise. Every orthonormal 
basis has exactly d elements, where d := dim {IK} is the Hilbert dimension 
of "K. A ket \ip) € "K has a unique decomposition into any orthonormal 
basis {|ei)}, where it can be represented as a d x 1 column vector. 



d 

{eiW) lei) ~ 






(e2|V'> 



V {earn / 
Similarly, a bra {ip\ £ "K* can be represented as a 1 x d row vector. 

d 

{ij\=Y^{ij\ei){e,\^{{Mei) (^jea) ... (^|e,) ) . 

Direct Sum Spaces 

Given two Hilbert spaces "K and IK', we introduce the direct sum Hilbert 
space of "K and IK', denoted IK © IK'. This space consists of linear combi- 
nations of tuples in IK X IK', which we denote by \ip) ® \ip'), where \tp) £'}i 
and IV'') G IK'. The direct sum space is motivated via its inner product, 
which we want to be a sesquiliniar extension of the relation 



for any two tuples |V') ® W)-, W) © W) G IK © IK'. This inner product is 
positive-definite if and only if cx{\ip) © \tp')) + ](/?) © \(p') = (ajV') + W)) © 
(ya\ip') + W)) for any a G C. This constitutes the rule for multiplication 
by a scalar and addition of elements in IK © IK' and we may thus write 

IK © IK' := span { jV') © W) : |V'> G ^, IV'') G IK'} . 

An orthonormal basis of this space is given by {|ej) © 0} U {0 © |e')}, 
where {|ej)} is an orthonormal basis of IK and {|e')} is an orthonormal 
basis of IK'. Hence, dim {IK © IK'} = d + d' , where d and d' are the Hilbert 
dimensions of IK and IK', respectively. 

Tensor Product Spaces 

Given two Hilbert spaces IK and IK', we define the tensor product Hilbert 
space of IK and IK', denoted IK © IK'. The space consists of linear combina- 
tions of tuples (pure tensors) in IK x IK', denoted \ip) © |V''), where iV') G IK 
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and \tp') € "K' . Again, we motivate the tensor product space via its inner 
product, which we want to be a sesquihniar extension of the relation 



b\®{^'l\^)®y)) = {^\^){^'y). 

for any two tuples \ip) ® |V'')) W) ® W) G ^ ® !K'. This inner product is 
positive-definite if and only if the following relations are satisfied. For any 
a G C, we need 

a{\^) ® W)) = {c^m ® W) = \^) » {aW)) , 

1^) ® W) + 1^) ® W) = IV-) ® {W) + W)) and 
1^) ® W) + W) ® W) = (1^) + W)) ® W) . 

The tensor product Hilbert space is then defined as the vector space built 
from linear combinations of all pure tensors modulus the above equivalence 
relations. Moreover, if IK has an orthonormal basis {|ej)}i and !K' has an 
orthonormal basis {|e')}j, then the pure tensors |ej) (8) |e') ^'K(i^'K' form 
an orthonormal basis of !K (g) IK' and dim {IK <S> IK'} = d ■ d' . 

2.1.2 Operators on Hilbert Spaces 

Linear Operators 

We denote the set of linear operators from IK to IK' (vector space ho- 
momorphisms) by i2(IK, IK'). Every operator L E i2(IK, IK') has a unique 
decomposition into any pair of orthonormal bases, {|ei)} of IK and {|e')} 
of IK'. The operator can be represented as a d' x d matrix in these bases. 



L = ^(e^-|L|ej) |e^-)(ej 



i,j 



( [L]ii [L]i2 [L]i3 ••• [L]id \ 

\L\2\ [-L]22 [-^]23 • : 



(2.2) 



where [L\ji := (e'|L|ej). Note that representations of kets, bras and op- 
erators in a particular basis are only used as illustrations throughout this 
thesis. 

For every L E £(IK, IK'), we define its adjoint operator L"^ E £(IK',IK) 
as the unique operator that satisfies 



{ip\L\^) = {ip\L^\ij) for ah \<f) E IK and IV') E IK' . 

(This is equivalent to the condition (1-0), i|<^)) = (L"''|0), |c^)) expressed 
in terms of the inner product.) In particular, this definition, together 
with (2.1), implies that a ket L\lp) has the dual bra {^\L\ 
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Moreover, we will also need the transpose L^ of L with regards to a 
pair of bases, {|ej)} of ^ and {|e')} of "K' . This is defined as 



L-:=J](e;.|L|e.)|e.)(e;.| 



The support of an operator L G £(?{, 'K') is the subspace of !K spanned 
by all kets that are not mapped to zero by L, that is 

suppjL} := span{|c/?) : \(p) £ "K and L\ip) / 0} . 

The rank of L is the dimension of its support, rankjL} := dim{supp{L}}. 
In contrast, the kernel of L is the subspace orthogonal to the support of 
L, namely 

kernjL} := spanjl^^) : If) G 'K and L\{p) = 0} = J{/supp{L} . 

Finally, the image of L is the subspace of Ji' spanned by L, that is 

image {L} := spanjLl^?) : \(p) G Ji} . 

Projectors, Identity and Inverse 

The linear operators from 'K onto itself (vector space endomorphisms) are 
denoted C{'K) := C{'K,'K). A projector into a subspace 3i' of "K is an 
operator P G C^K) with supp {P} = image {P} = "K' that acts as an 
identity on aU \ip') G Ji', i.e. P\ip') = \tp'). Thus, P^ ^ P and P = pt. 
We use the symbol 1 to denote the identity operator on Ji, which is the 
projector from 'K onto ^. The identity operator can be decomposed in any 
orthonormal basis {|ej)} of 5C as 1 = Yli |ei)(ei|- -l^or any linear operator 
L G C^K), we denote the projector onto its support by 11 . 

For operators L G C^K), we define their inverse (if it exists), L^^ G 
C{!K), as the unique operator satisfying L~^L = LL'^ = 1. We will often 
use a generalized inverse, which is defined for every operator L G CCK, "K') 
and is the inverse of L on its support. This means, we define the generalized 
inverse L^^ as the unique operator satisfying L^^L = U and supp {L^^} = 
image {L}. 

Isomorphisms, Unitaries and Isometries 

An isomorphism is a linear bijective map associating elements of two struc- 
tured sets in a way that preserves that structure. In particular, an isomor- 
phism u: "K ^ '}i' between Hilbert spaces !H and 'Ji' preserves the inner 
product and, thus, satisfies 

<|V^),|9.)> = (n(|^)),n(|v.))) (2.3) 
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for all |^),|(/?) G 'K. We call two Hilbert spaces !K and "K' isomorphic, 
denoted Ji = 'Ji' , if there exists an isomorphism between 'Ji and Ji' . Two 
Hilbert spaces are isomorphic if and only if they have the same Hilbert 
dimension. (To see this, note that an isomorphism can always be seen as 
a bijective map between elements of two orthonormal bases.) 

A unitary operator is an operator U E £{%) that is an isomorphism 
between "K and itself. Since an isomorphism has to satisfy (2.3), we have 
{il)\ip) = {il)\U^U\tp) or, equivalently, U^U = 1. Hence, the inverse of a 
unitary operator U is well defined and U^^ = W . 

A partial isometry is an operator V £ £(!K, J{') that is an isomorphism 
between supp{y} C Ji and image {y} C Ji' . A partial isometry satisfies 
V'V = n and, thus, V'^ = V' is its generalized inverse (and a partial 
isometry too). An isometry is a partial isometry with full support on Ji. 

In the following, we denote the set of unitary operators on a Hilbert 
space 'K as l/({^i) and the set of partial isometries from Ji to 'K' as 
UCK, '}{'). Isometries can always be understood as embeddings. Let !K 
and '}{' be two Hilbert spaces such that dim{!K} < dim{5{'} and let 
V € U{'K,'J{') be an isometry that embeds Ji into Ji' , i.e. it satisfies 
V'V = 1 on !K. Then, for every operator L G C{!K), we implicitly define 
its embedding L' := VLV^ G £{Ji'). 

Trace 

For any Hilbert space CK, the trace of an operator in £(!K) is the (linear) 
functional tr : C{!K) — )• C with the defining properties 

tT{AB) = tT{BA) and tr(l) = d (2.4) 

for all linear operators A £ £(;K, Ji') and B G C{'K\ "K). In particular the 
trace is invariant under unitary conjugation, tiiJJ^ L U) = tr(L) for any 
U G U{'K) and L G C{'K). This implies that there exists a representation 
of the trace as a functional on matrix representations that is independent 
of the choice of basis used for the representation. 

If we choose an orthonormal basis {|ei)} for "K and {|e')} for "K', the 
operators AB and BA can be represented as matrices with entries 

[AB],, = Y,{ej\A\eu){eu\B\e[) and [i?A],, = ^(e',|A|e,)(e,|i?|e',) 
k k 

and it is easy to verify that the only functional that satisfies (2.4) is the sum 
of the diagonal elements, XU^-^]** = SJ-^^]**- Hence, for any L G C{%) 
and any basis {|ei)} of "K, we have tr(L) = X^jWii — X]i(^«l-^ki)' which 
is how the trace of a matrix is commonly defined. 
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Singular Values 

For any operator L G £(J{,;K'), there always exists a pair of bases {|ej)} 
of !K and {|e^)} of ^' sucii that L can be decomposed as 

L = ^Sj |e-)(ei| and si > S2 > • • • > Sm > , (2.5) 



where m is the rank of L. This is called the singular value decomposition 
and the unique positive Si = Si{L) are called singular values. The singular 
values are invariant under unitary rotations, as these operations can be 
absorbed into the basis. Thus, for any U G U{'K) and V G U{Ji'), 

Si{V'LU) = 8i{L). (2.6) 

Support and image of L can be expressed in terms of the two bases, 
that is suppjL} = span{|ej)} and image {L} = span{|e^)}. 

Note that the operator L^ L = X^j Sj^ |ej)(ej| G C{'K) has a unique 
positive square root, namely the modulus of L, 



\L\ := \/Ul = ^ Si\ei){ei 



Comparing this with the singular value decomposition (2.5), we find the 
polar decomposition, L = W\L\, where W : |ej) i— )■ \e[) is a partial isometry 
from IK to ^i' defined through the bases of the singular value decomposition 
of L. 

Hilbert-Schmidt Inner Product and Schmidt Decomposition 

Given two Hilbert spaces ^ with orthonormal basis 23 = {|ej)} and 'K' , 
we associate operators in C{'K,'J{') with tensors in "K "K' through the 
isomorphism vec^ : L i— )• ^^ |ej) (8) L\ei). Using the decomposition of L 
in (2.2), we find that vec(-L) can be written as 

vec3(L) = ^ [L]j^i \ei) ® |e^-) , 
id 

which simply corresponds to the rearrangement of the matrix entries of L 
(in the given bases) as a column vector. 

This isomorphism is useful because it induces an inner product on linear 
operators. We define the Hilbert-Schmidt inner product on the complex 
Hilbert space £(!K, IK') as 

{A,B) := (vecs(^),vecs(S)> = Y.{ej\e^){ej\A^ B\e^) = ^.i^M^ B\e^), 

id i 
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for any A,B G C{'J{, !K'). This expression is equal to the trace and, thus, 
independent of the basis S chosen for the isomorphism: 

{A,B)=tiiA^B). (2.7) 

The tensor vec:B(l) = J2i \^i) '^ \^i) takes a special role in the analysis 
of quantum systems. In particular, we will need the following property. 

Lemma 2.1 (Mirror Lemma). Let Ji be a Hilbert space with orthonormal 
basis "B and L G C{J{), then (l(g)L) vec3(l) = {L'^ 1) vecs(l), where the 
transpose is taken with regards to "B. 

Proof. By inspection. D 

This, together with the above isomorphism, can be used to prove the 
existence of the Schmidt decomposition. 

Lemma 2.2 (Schmidt Decomposition). Let "K, "K' he Hilbert spaces and 
let \6) G IK (8) 'K' . Then, there exist orthonormal bases {\'&i)} of "K and 
iWi)} of "K' as well as non-negative numbers Sj such that 

\e) = Y,s,m0\^',). 

i 

Proof Given a linear operator L £ C{'K, IK') and an orthonormal basis 
"B = {|ej)}, we use the singular value decomposition of L to get 

vecs(L)=vecs(yt5C/) = ^Si(L) U^ \ei) V^ {a) , (2.8) 

i 

where S = Yli^ii^) \^i){^i\ is diagonal in 23. The isometrics U G ly({!K) 
and V G U{^' ,'K) map the bases of the singular value decomposition to 
S. Hence, every tensor can be written in the form (2.8), which concludes 
the proof. D 

Operator Norms 

The singular value decomposition allows the definition of various norms on 
£(IK, IK'). In particular, we will often use the Schatten norms. For p > 1, 
they are defined as (cf. e.g. [Bha97, Wat08]) 



, m X i 

L\\,:={Y,s^{LYy. 



In particular, we will often use the infinity norm, ||-||oo) which is equal to 
the induced norm of the underlying Hilbert space norm, that is 

i-^ioo = si{L) = sup 
\-4>)<^-K 
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The 1-norm is equal to the trace norm, 

m 

ll^l|l = X]^*(^)=*''l^l- 
i=l 

And, finally, the 2-norni is equal to the induced norm of the Hilbert- 
Schmidt inner product, 



. m ^ 1/2 

^ 1=1 ^ 



These three norms satisfy ||L||oo ^ i-^lb ^ ll-^lli- And, thanks to (2.6), 
they are invariant under unitary rotations. Hence, |||L||| = |||yL J7|||, where 
III -III denotes any of the Schatten norms introduced above. For any three 
operators L,M,N E C^K), these norms satisfy 

|||MLA||| < ||M|U |||L||| ||A|U . (2.9) 

Moreover, these norms are sub-multiplicative, i.e. |||MA^||| < |||M||| |||A^|||. 

2.1.3 Positive Semi-Definite Operators 

Hermitian Operators 

An operator M E £(!K) is called self-adjoint or Hermitian if M^ = M. 
The set of Hermitian operators on 'K is denoted T-L{'K) := {M G C{'K) : 
M' = M}. Note that the real vector space ?^(!K) together with the Hilbert- 
Schmidt inner product in (2.7) form a real Hilbert space. 

Every M E T-L{'K) has an eigenvalue decomposition, namely 

M = ^Aj |ej)(ej| , where Aj E M and |Ai| > IA2I > . . • > |Am| > 0. 

i 

The coefficients Aj = Aj(M) are called eigenvalues of M and |ej) are eigen- 
vectors. Together, the eigenvectors form an eigenbasis {|ej)}, which is an 
orthonormal basis of suppjM} = image {M}. This eigenbasis is unique if 
and only if all the eigenvalues are mutually different. The singular values 
can be expressed in terms of the eigenvalues, that is Si{M) = |Ai(M)|.^ 

The eigenvalue decomposition is particularly useful to define the ac- 
tion of functions on operators. Note, for example, that M^ = MM = 



^The existence of the eigenvalue decomposition follows, for example, from the 
fact that the singular value decompositions of A4^ M = ^iSi'^\ei){ei\ and MM'' = 
X^i ^i^\^'i){^i\ have to be the same. Thus, if all singular values are mutually different, we 
have \ei) = e"^|ei) and the eigenvalue decomposition is of the form M = ^^ ±Si|ei)(ei|. 
More generally, an eigenvalue decomposition exists for all normal L £ C{'K), where 
normal means that L^ L — LL'' . 
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Y^i\'?\e,i){e-i\, \M\ = Y.i\M |ei)(ei| and M ^ = Y.i^\(^i){^i\- We gener- 
alize this and define the action of arbitrary functions / : M/{0} — t- M on 
Hermitian operators as 

/:M = ^A,|ei)(ei| ^ J^ /(A,) |e,)(e,|. 

i i 

Note that the function / only acts on the support of the Hermitian oper- 
ator, leaving its kernel intact. 

For any M G %{%), we denote by {M}_(. and {M}_ the projections of 
M onto its positive and negative eigenspace, respectively, that is 

{M}+= Y, ^i|ei>(ei| and {M}_ = ^ \i\ei){ei\ . 

i:Xi>0 i:Xi<0 

Positive Semi-Definite Operators 

The set of positive semi-definite operators, V{^) C 7i{'K) is the set of 
operators that have positive inner products with all vectors in 'K, namely 

V{Ji) := {M G n{:K) : (V'lM]^) > for aU |V) G ^} 
= {M e'Hi'K) :Xi{M) > Vi} . 

We write M > if and only if M G V{'K). In the following, we simply 
call these operators positive. Moreover, given two Hermitian operators 
M,N e'H{'K), we write M > N \i and only if M - A^ > and we write 
M < A^ if and only if A^ — M > 0. Note that this relation constitutes a 
partial order on 'H(3f). 

2.1.4 Operators on Tensor Spaces 

We will often use linear operators on tensor product spaces, for example 
£{% ®'K','k® Ji') = C{Ji, "k) (g) C{Ji', "k'), where the second isomorphic 
space is constructed from the Hilbert spaces £(IK,S) and C{'K','K') us- 
ing (??). Hence, we can write every element L G £(!K <S> "K' , 'K <^ !K') as a 
linear combination of the form 

L = ^ Xa/3 o-Q (Xi cr^j , where Xa/3 G C , (2.10) 

a,l3 

and {cTa} and {cr'o} are bases of C{!K,Ji) and £(?{', !K'), respectively. If 
L is Hermitian, then the Xa/3 can be chosen real and the bases can be 
chosen Hermitian. (This follows directly from the fact that T-L{'K) is a real 
Hilbert space.) Furthermore, this allows us to introduce a trivial extension 
of operators L G C{'K, k) to operators L (g) 1 in C{'K i^Ji' ^Ji® %.'). In 
the following, this extension is assumed implicitly whenever an operator 
defined on a subspace is applied to a tensor space. 
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2.1.5 Completely Positive Maps 

Super-Operators 

A super- operator is a linear map from linear operators on one Hilbert 
space to linear operators on another Hilbert space. For example, if the 
two Hilbert spaces are !K and !K', we denote the set of super-operators 
from C{!K) to C{!K') by C{C{'K),C{'K')). The super-operators form a vec- 
tor space and, since every operator in C{!K) can be decomposed according 
to (2.10), the action of a super-operator on an operator in a (larger) tensor 
space is well-defined via linearity. We use the symbol o to denote concate- 
nations of super-operators, for example (£ o 3")[-] = £[9'[-]]. 

For any super-operator £ G C{C{Ji) , C{!K')) , its adjoint super-operator, 
£^ G C{C{'K'),C{Ji)), is defined via the Hilbert-Schmidt inner product as 
the unique operator satisfying 

{s.[A],B) = {A,e.^[B]) for all A e C{:k),b e C{:k') . 

Completely Positive Maps 

super-operators that (consistently) map positive operators onto positive 
operators are called completely positive maps (CPMs). 

Definition 2.2 (Completely Positive Map). Let "K and 'Ji' he Hilbert 
spaces. A super- operator £ G C{C{'K),C{!K')) is called completely posi- 
tive, if, for any auxiliary Hilbert space "K" , it holds that 

£[M] > for all Af G P(:K (» "K") . 

An example of such a completely positive super-operator is the conju- 
gation with an operator L G C{'K,'K'), that is the map £ : M i— )• LMU. 
We will often use the following basic property of completely positive maps. 
Let £ G jC{C{!K),£,{'K')) be completely positive, then 

A>B=^ e,[A] > e,[B] for all A,B £ n{Ji) . (2.11) 

Furthermore, we take note of the following property of positive semi- 
definite operators. For any M, N G V{'K), we have 



tr(MiV) = tr(VMiVVM) > 0, 

where the last inequality follows from the fact that the conjugation with 
vM is a completely positive map. In particular, if A, y G 'H{^) satisfy 
A > y, we find tr(MA) > tr(My). 

A completely positive map £ is called trace preserving if tr(£[M]) = 
tr(M) for all M G C{^i). We denote the set of all trace preserving com- 
pletely positive maps (TP-CPMs) from C{Ji) to £(:K') by TiJi,Ji'). If £ 
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is a TP-CPM, then its adjoint £' is completely positive and unital. Since 

tr(M) = tr(8[M]) = tr (£"^[1]M) for all M £ C{Ji) , 

unital maps are defined by the property that they map the identity onto 
an identity, i.e. £' [1] = 1. 

Finally, a completely positive map £ is called trace non-increasing if 
tr(£[M]) < tr(M) for all M S P(!K). It is easy to verify (using the same 
argument as above) that its adjoint is completely positive and sub-unital, 
i.e. it satisfies £^[1] < 1. 

Partial Trace 

Given a bipartite Hilbert space 'Ji (^ 'Ji' , we are interested in the partial 
trace super-operator, denoted tr^^' G TCK g) 'K','K). The partial trace is 
defined as the adjoint super-operator to L i— )• L (g) 1, which maps operators 
L £ C^K) to C{!K (g) "K'). It is easy to verify, using (2.7), that the trace is 
the special case of the partial trace where "K = C 

To justify this definition, let us investigate the action of the partial 
trace tr^^' of a product state K L, where K E £(3f ) and L G C{Ji'), on 
an arbitrary state Q G C^K). We have 

(Q, tr„. {K(g)L)) = {K<g)L,Q^t^,) = {K, Q) tr(L) . 

Since this holds for all Q, we have trj^;'(i^ (g L) = tr(L)i^. 

The partial trace is cyclic in operators on the same Hilbert space and 
commutes with operators on other Hilbert spaces. Specifically, let L G 
C{Ji :K', :K (g) Ji") and K e C{Ji", Ji'). We use the decomposition L = 
Y^a sXapf^a <^ crL of (2.10). Since the partial trace is linear and K only 
acts on !K' , we have, using he cyclicity of the trace, 

tr„.(KL) = ^ti{Ka'f^) a^ = ^tria'/^K) aa = iT^"{LK) . 

a,(S a,/3 

Moreover, using this decomposition, it is easy to verify that ti^{KL) = 
KtTjt{L) and tr„(Li^) = tr j,{L)K. 

Choi-Jamiolkowski Isomorphism 

In the same spirit as the vec-isomorphism between operators £(!K, 'K') and 
tensors 'K(^'J{' , we now define the Choi-Jamiolkowski isomorphism [Jani72], 
Q, between super-operators in C{C{!K), C{^')) and operators in C{!K'(S''K). 

: £ ^ w^ = £ [|r)(r|] , where |r) = ^ \e^) la) G V{Ji » ^K) 
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and {|ej)} is an orthonormal basis of Ji. The state uj is called the 
Choi-Jamiolkowski state of £. The inverse operation 0,^^ maps Choi- 
Janiiolkowski states lo S £(!K' (8) ^) to super-operators 

£ : £(:K) ^ £(:K'), X^tr„(L^^(V(»X^)) 

where the transpose is taken with regards to the basis {|ei)}. 

There are various relations between properties of super-operators and 
properties of the corresponding Choi-Jamiolkowski states. The following 
can be verified by inspection. 

• £ is completely positive <;=^ uo^ is positive semi-definite. 

• £ is a trace-preserving <^=^ tr5j:'(cj^) = 1^. 

• £ is unital <;=^ tj:^{uj^) = Ij^'. 

Kraus Operators and Stinespring Dilation 

The following two lemmas are of crucial importance in quantum informa- 
tion theory. They describe two alternative representations of completely 
positive^ maps, especially trace non-increasing and trace preserving maps. 
Every completely positive super-operator can be represented as a sum 
of conjugations of the input with Kraus operators. [HK69, HK70]. 

Lemma 2.3 (Kraus Representation) . A super- operator £. G C(C{'K),C{'K')) 
is completely positive if and only if there exists a finite set of linear oper- 
ators {Ek}, Ek G £(;>{, ;K') such that 

£ [^] = ^ EkA eI for all A e C{'K) . 

k 
Furthermore, a completely positive £ is trace non-increasing if and only if 
^^ Ej^Ek < 1 and it is trace preserving if and only if ^^ ^k^k = 1- 

The operators {E^} are called Kraus operators. Note that the adjoint 
£T of £ is completely positive and has Kraus operators {EJ^ since 

iT{S.^[B]A) = iT{Bt[A])=ii{y2ElBEkA\ for ah yl G /:(:K) . 

k 

Moreover, every CPM can be decomposed into its Stinespring dilation 
form [Sti55] as follows. 

Lemma 2.4 (Stinespring Dilation). A super- operator E, G £(£(!H), £(IK')) 
is completely positive if and only if there exists a Hilbert space Ji" and an 
operator L G Ci^i, "K' ® "K") such that 

£(^) = tr„„(LALt) foraU AeC{'K). 

Moreover, if £ is trace preserving then L is an isometry. If £ is trace 
non-increasing, then L is an isometry followed by a projection in V{'K"). 



^An extension of this to arbitrary super-operators is possible (see, e.g. [Wat08]). 
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Proofs of these lemmas can be found in most quantum information 
textbooks (see, e.g. [NCOO]). We will only prove the result for trace non- 
increasing CPMs, which is a bit less standard. 

Proof. Let £ be a trace non-increasing CPM. First, note that its Kraus 
representation has to satisfy, for all A £ VCK), 

tr(£(yl)) = tr (j^EkAEl) = tr (^4^^^) < tr(.4) . 

k k 

This is equivalent to the condition Ylk=i ^k-^k ^ 1- Hence, the map can 
be extended to a trace preserving CPM by adding another Kraus operator 
into the sum, e.g. 



En+i = (l-Y,EiE^ 



k=l 

We now construct a Stinespring dilation for this extended operation. In 
fact, a possible dilation is L = "^2=1 ^k^l^k), where {|efc)} is an orthonor- 
mal basis of "K" . This operation is an isometry and a Stinespring dilation of 
£ can be recovered by projecting onto Y12=i \^k){ek\ after applying L. D 

2.2 Quantum Mechanics 

In this thesis, we will use a mathematical model for quantum mechan- 
ics — the density operator formalism on finite-dimensional Hilbert spaces — 
that is restricted to physical systems with a finite dimensional configura- 
tion space. This means that continuous observables such as position and 
momentum need to be considered discretized and bounded. While it is 
unknown whether such a framework is sufficient to describe all possible 
correlations between observations of physical systems [SW08]^, it offers 
the opportunity to focus on the main physical concepts without getting 
lost in delicate mathematical arguments. Moreover, the main results of 
this thesis are independent of the actual dimension of the physical systems 
under consideration and some of the results have already been re-derived 
in a more general model of quantum mechanics that is based on infinite 
dimensional Hilbert spaces [FAbRll] or (von Neumann) operator algebras 
of observables [BFSll]. 

This section is partly inspired by Nielsen and Chuang [NCOO] as well 
as Hardy [HarOl] and introduces quantum mechanics from a strictly infor- 
mation theoretic perspective.^ The most important deviations from the 
standard treatment of quantum mechanics are pointed out. 



^More precisely, it is unclear whether all correlations can be approximated to arbi- 
trary precision using the density operator formalism. 

^Quantum mechanics is often formulated in a way that highlights its relation to 
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2.2.1 Systems and States 

We use a very abstract notion of physical systems, describing them as gen- 
eral purpose information carriers without specifying their actual physical 
realization. In fact, the only system parameter we consider is its dimen- 
sion, which, as we will see, corresponds to the dimension of the Hilbert 
space used to describe the system. 

The dimension of an isolated system is given by the number of mutu- 
ally perfectly distinguishable states that can be prepared on the system. 
This assumes perfect (idealized) preparation and measurement equipment. 
Alternatively, the dimension can be interpreted as the information storage 
capacity of the system. Very abstractly, an encoder is a map £ from a set 
X to states of the system and a decoder 2) is a map from states of the sys- 
tem to Af. A pair {£, D} of encoder and decoder is perfect if x = D[£[x]] 
for all X G Af. The dimension of a system is then given by the maximum 
cardinality of a set X such that there exist perfect encoders and decoders 
between X and states of the system. 

The simplest such system, a qubit or two-level system, has dimension 
two and may have different physical realizations. For example, the infor- 
mation could be encoded as the spin degree of freedom of an electron or 
the polarization of a photon. 

Postulate 1 (State Space). Isolated quantum systems are mod- 
eled as Hilbert spaces with the dimension of the system. The 
system's state is represented by a positive semi- definite opera- 
tor with unit trace acting on this space. 

The state of an isolated quantum system A is thus fully characterized 
by all linear functionals on the state space, i.e. the functionals tr{pALA) 
where L^ G £(^^) and Pa is the state of the system. 

We denote isolated physical systems with capital letters, i.e. A, B, C, 
and their associated Hilbert spaces with 'Ka,'Kb,'Kc. The states them- 
selves are denoted using lowercase greek letters, i.e. Pa, erg, Tc, where the 
subscripts indicate which system is meant when necessary. The dimension 
of a system A is denoted by dA '■= dim{3f^}. 

We use S={'Ka) to denote the set of quantum states on A, i.e. the 
set S={'Ka) '■= {p G VCKa) : tr(p) = 1}. Hence, a state of a system A 
is represented as an element Pa G 5=(^a)- In addition, we sometimes 
employ a larger set of states on ?{^, the sub-normalized states 5<(^a) := 
{p £ VCKa) : < tr(p) < 1}. These states by themselves do not have 
a physical interpretation and usually only appear in technical statements 



classical mechanics. In quantum information science, due to its interdisciplinary na- 
ture between physics, computer science and information theory, a different approach to 
quantum mechanics has proven fruitful. 
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However, they can be seen as normalized quantum states on a Hilbert 
space "Ka' that are projected onto a subspace 'K^ of "K^' ■ 

We cah a positive semi-definite operator p^ G P(?Ca) pure if it has 
rank one, i.e. if rankjp^} = 1. Pure operators can be represented as kets 
\p)a ^ "^A, where \p)^ is determined by the relation p^ = \p){p\a up to a 
phase factor. An operator that is not pure is called mixed. We often employ 
the completely mixed state on a system A, which is given by vr^ := Ia/cIa- 

In most introductions to quantum mechanics — supposably for histori- 
cal reasons — the theory is first formulated in terms of pure quantum states 
and then later extended to arbitrary mixed states. From an information 
theoretic perspective, however, mixed states are more fundamental as they 
are generalizations of classical random variables. 

Classical Registers 

Discrete probability distributions can be conveniently represented as states 
of a quantum system. We call these systems registers and typically denote 
them by the letters X, Y or Z. To a register X and its respective Hilbert 
space JCx, we associate an orthonormal basis {|x)^} where x £ X and X 
is a set with cardinality \X\ = dx- 

A probability distribution on Af is a map Px : Af — )• [0, 1] such that 
^^ Px{x) = 1. It can be represented as a quantum state px on X, i.e. 



Px = ^Pxix)\x){x\ 



It follows from the properties of Px that this state is positive semi-definite 
and has unit trace. In classical probability theory, a register corresponds 
to a discrete random variable and the state of a register to the probability 
distribution over the random variable. In this sense, quantum mechanics 
can be seen as a generalization of classical probability theory. Moreover, 
registers will become important to describe the outcomes of measurements, 
as we will see in the following. 

2.2.2 Separated Systems 

Consider two separate quantum systems, A and B, modeled by a Hilbert 
space "Ka and "Kb, respectively. We can prepare d^ different perfectly dis- 
tinguishable states on system A and d^ different perfectly distinguishable 
states on system B. On the joint system AB, we can thus prepare dA^s 
different perfectly distinguishable states. Hence, according to Postulate 1, 
the dimension of the joint system is d^s = dA^s and it can be modeled 
using the Hilbert space !Kab = ^a (X" J^s of dimension dAdn- Joint states 
of the two systems are then described by normalized positive semi-definite 
operators Pab G S={^ab)- 
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Given a state Pab on the joint system, what are the states of the in- 
dividual systems A and Bl As we have seen before, the state of the 
system A is fully characterized by the linear functionals acting on it, i.e. 
the functionals tr(p^sL^) = tr(/9^s(L^ (g) 1^)) = ^'^{S'^b{Pab)La)- Here, we 
introduced the notation tr^ = tr^^ for the partial trace over the subsystem 
B. Since the state on A is thus fully characterized by linear functionals 
on the operator trfl(/>AB), we define the marginal state or marginal of Pab 
on A as Pa '■= ti s^Pab)- Similarly, we define the marginal of Pab on B as 
Pb '■= ti AiPAB)- In the following, the marginals are introduced implicitly 
with the joint state. ^ 

Entanglement 

We call a state Pab of a joint quantum system AB separable if it can be 
written in the form 

p^g = ^ a^ ^ r^ , where a^ G V{JiA) and r^ G P(J{s) . (2.12) 
k 

Otherwise, it is entangled. The occurrence of entangled states is one of the 
most intriguing properties of the formalism of quantum mechanics. 

The prime example of an entangled state is the maximally entangled 
state 

Ua 

I 

where {|ej)} is an orthonormal basis of 'Ka — ^a'- This state cannot be 
written in the form (2.12) as the following argument, due to Peres and 
Horodecki [Pcr96, HHH96], shows. Consider the super-operator T : p^' ^ 
/9^,, where the transpose is taken with regards to the basis {|ej)} of j4'. 
This super-operator is called the partial transpose (on A') and it is positive 
but not completely positive. Applied to separable states (2.12), it always 
results in positive semi-definite states X^fc '^a ^ (''"s) • Applied to ^aa', 
however, we get 

UA . . eiA . . 

This state is not positive semi-definite. For example, we have 

2 
(c/?|'J[7^^']|(/9) = --— , where |v?) = |ei) (g) |e2) - |e2) (gi |ei) . 
dA 

Generally, we have seen that a bipartite state is separable only if it remains 
positive semi-definite under the partial transpose. The converse is not true 
in general. 



^Whenever we introduce a state (e.g. pabc) on a joint system, we also implicitly 
introduce all its marginals (e.g. pab = trc(pAsc) and pc = trAs(pAsc))- 
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Classical-Quantum Systems 

Joint systems where one (or more) subsystems are classical registers are of 
particular importance. Consider, for example, the case where a classical 
register X is described jointly with a quantum system A. The possible 
joint states pxA can be written as^ 

PxA = J^Px(x)|x)(xU®rf, where r^ G 5=(:Ks) (2.13) 



and Px is a probability distribution on X . These states are called classical- 
quantum (CQ) states. They are of the form (2.12) and, thus, separable. 

A special case occurs when the two classical registers, X and Y , are 
considered jointly. In this case, the states pxv are of the form 



PxY = ^ Pxy{x, y) \x){x\x ® |y)(y|y 
= ^P^(x)|x)(xU^^P^(j/) 



Yi 



where Pxy is a probability distribution on X ^y and, for each x, Pyiv) = 
PxY{x-,y)/Px{x) is the conditional probability distribution on y given a 
fixed X ^ X . Such states allow the description of arbitrarily correlated 
classical random variables. 

Purifications and Extensions 

For any state Pa of a system A^ we can find a purification on an auxiliary 
system A' with 'Ji^ — ^a'- A purification is a pure state \p) E ?C^^' of the 
joint system AA' with the property that Pa = tr^' (/?aa')- More specifically, 
if Pa = X^j Aj |ej)(ej| A is the eigenvalue decomposition of Pa, then a possible 
purification is given by Ip)^^/ = Y2i V^i\^i)A ® ki)^'- Purifications of Pa 
are separable if and only if p^ is pure. More generally, we call a (not 
necessarily pure) state Paa' that satisfies tvA'ipAA') = Pa an extension of 
the state Pa- 

Purifications of CQ states of the form (2.13) can be constructed as 

where |r^) is a purification of r^ on AA' . We call the two registers X and 
A' coherent classical. In particular, the marginal states Pxab and Px'ab 
are classical on A and A', respectively, and they are equal with regards to 
the isomorphism Ix)-,^ i— )• \x)^,. 



^This describes the full set of states that have a classical marginal on X. 
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2.2.3 Evolutions and Measurements 

Evolution 

The evolution of a separate quantum system is most generally described 
by a quantum channel. A quantum channel is a linear map (i.e. a super- 
operator) from quantum states on a system A to quantum states on a 
system B. Since such super-operator map quantum states onto quantum 
states, they must necessarily be positive and trace-preserving. Moreover, 
since quantum channels describe the evolution of quantum systems that 
may be part of a larger joint system, they are required to map positive 
semi-definite states of any joint system to positive semi-definite states. 
This implies that they are completely positive and, thus, TP-CPMs. 

Postulate 2 (Evolution). The evolution of quantum systems is described 
by trace-preserving completely positive maps. 

An important example of such a map is a time evolution. Here, system 
A is any system at time to and system B the same system at a later time 
ti > to- In the traditional treatment of quantum mechanics, the time 
evolution of a system is described by a unitary evolution that is induced 
by the Hamiltonian of the system. A unitary evolution is a special case 
of a TP-CPM and describes the evolution of a closed system, i.e. a system 
that does not interact with any other system. 

Measurement 

A measurement of a quantum systems can be described in the above frame- 
work of general evolutions. 

Postulate 3 (Measurement). A quantum measurement is a trace-preser- 
ving completely positive map from a system to a classical register contain- 
ing the measurement outcome and a system that contains the state of the 
system after measurement. 

Let A be a quantum system, X the classical register containing the 
measurement result and A' the system modeling the quantum system after 
measurement. The corresponding measurement map, M £ T{^a,'^xa'), 
with "Ka' — ^A) has a Kraus decomposition 

M:p^^EkpEl where Ek G C{J{a, ^xa' ) and ^ ^^fe = ^a- 
k k 

Since the resulting state is required to be classical on the register X, we 
further know that 

M[pa] = y Px{x) \x){x\ rf, where r^, G S=iJiA')- (2.14) 
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Here, Px{x) is the probability that the outcome "x" is measured and t^, is 
the state of the system conditioned on the event that the outcome "x" has 
been measured. Due to (2.14), the Kraus operators E^ necessarily have 
the form E^ = \xi^) (g) F^, where -F^ E C^Kji, ^a') — -^I^a) and, thus, 

^-' = ^) E ^''P-^l ^^d Px(^)=tr( ^ fIf.p 

The second equahty foUows from the fact that t^, is normahzed. 

Hence, a measurement is fuUy specified by the operators {F^} and 
the partitioning {k : Xk = x}. Note that this viewpoint is consistent; 
specificahy, Px is a probabihty distribution since 

0< Y. PkFk<tA and Y. E 4Fk = Y^lP>' = ^^- 

Often we are not interested in the specific state after measurement but 
only the probability distribution the measurement induces on the register 
X. In this case, the measurement is fully characterized by the operators 
Mx = X^fci' =x-^kFk^ which define the probability Px{x) = ti{Mxp). The 
set {Mx} is called a positive operator-valued measurement (POVM). 

Definition 2.3 (POVM). A positive operator-valued measurement on a 
quantum system A is a set {Mx} with Mx G V{'Ka) and Y^x-^^ ~ ■^a- 
The corresponding measurement TP-CPM M : "K^ — >• ^x is given by p >-^ 
^^ tr^' (Mj,. /o) \x){x\ and the operators Mx are called POVM elements. 

We often use the following Stinespring dilation of POVM measure- 
ments. The isometry U G U{'Ka,'^a'xx') maps the state to the classi- 
cal register X containing the measurement outcome, a coherent classical 
copy of it, X' , and a possible post-measurement state on A' . The post- 
measurement states that result from a POVM are not unique; however, 
conventionally, one takes Xk = k where k is chosen from the same set as x 
and Fk = \/Mx. Thus, 

M.\p\=\.Yx'A'^p'U'^) where C/ = 

A special case of a POVM occurs when the POVM elements are projec- 
tors, i.e. Mx = M^. We call such a measurement a projective measurement. 
If these projections are of rank 1, e.g. Mx = \x){x\x, the corresponding mea- 
surement TP-CPM takes on the simple form M. : p^^^ Y1,x^^\Pa\^) I^X^U- 

Comparing this to the traditional treatment of quantum measurements, 
we note that it is not necessary to introduce a separate formalism for 
measurements and that we can treat measurements as a special case of 
an evolution. This is possible since we always consider the outcome of 
a measurement as a random variable that is correlated with the state of 
the system after measurement and do not condition the resulting quantum 
state on a particular measurement outcome. 
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2.3 Mathematical Toolkit 

This section covers the most important mathematical tools used through- 
out this thesis. The (smooth) min- and max-entropy introduced in Chap- 
ter 4 can be formulated as semi-definite programs. Operator monotone 
functions are used to explore properties of another class of entropies, gen- 
eralizations of Renyi entropies, in Chapter 6. 

2.3.1 Semi-Definite Programs 

This overview is based on the Watrous Lecture Notes [Wat08] and all proofs 
can be found there. A semi- definite program (SDP) is a triple {A,B,^}, 
where A G ni^i), B G 'H{'K') and * G C[C{^),C{^')) is a super-operator 
from "K to 'Ji' that preserves Hermiticity. The following two optimization 
problems are associated with the semi-definite program. 

primal problem dual problem 



minimize: {A,X) maximize: {B,Y) 

subject to : ^[X] > B subject to : ^^[Y] < A 

X G v{:k) y g v{:k') 

We call an operator X G V^K) primal feasible if it satisfies ^[X] > B. 
Similarly, we say that Y G V{!K') is dual feasible if ^Tjy] < A. Moreover, 
we denote the optimal solution of the primal problem with a and the 
optimal solution of the dual problem with f3. Formally, 

a:=mi{{A,X) :XgP(^),^[X] > B} 

p:= sup {{B,Y) :Y GV{Ji'),'^^[Y] <A}. (2.15) 

The following two theorems provide a relation between the primal and 
dual problem of an SDP. 

Theorem 2.5 (Weak Duality). Let {A, B, ^} be a SDP and a, /3 defined 
as in (2.15). Then, a > /?. 

This implies that every dual feasible operator Y provides a lower bound 
of {B, Y) on a and every primal feasible operator X provides an upper 
bound of {A, X) on j3. 

Theorem 2.6 (Strong Duality). Let {A, B, ^} be a SDP and a, 13 defined 
as in (2.15). Then the following holds: 

• If a is finite and there exists an operator Y > such that ^"^[y] < A, 
then a = f3 and there exists a primal feasible X such that {A, X) = a. 

• If P is finite and there exists an operator X > such that ^[X] > B, 
then a = (3 and there exists a dual feasible Y such that {B, Y) = (3. 
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Optimization problems that can be formulated as semi-definite pro- 
grams can be efficiently solved numerically.^ 

2.3.2 Operator Monotone Functions 

Here, we discuss some useful properties of operator monotone, concave 
and convex functions. This section is largely based on Chapter V of Bha- 
tia [Bha97] and we simply repeat the results here in the form we need in 
later chapters. 

Operator monotone functions preserve the partial order on operators 
induced by '>' and are, thus, necessarily monotone. 

Definition 2.4. Let i^ QM. A function / : — t- M is operator monotone 
on Q if A > B implies f{A) > f{B) for any Hermitian operators A,B 
with eigenvalues in 0,. 

Similarly, operator concave and convex functions generalize the concept 
of concavity and convexity to operators. 

Definition 2.5. Let fi be an interval on R. A function / : — )• M is 
operator concave on Q if f{fJ.A + (1 — IJ-)B)^ > fJ-f{A) + (1 — fi)f{B) for 
all Hermitian operators A, B with eigenvalues in Q and all fi £ [0, 1]. The 
function f is operator convex on 0, if —f is operator concave on il. 

Prominent examples of such functions include (cf. Chapter V in [Bha97]) 

• The logarithm function, which is operator monotone on M^. 

• The function h : t >-^ —t log t with its extension to /i(0) = lim^^o h{t) = 
0, which is operator concave on M.q . 

• The family of functions ga '■ t h^ f^. These functions are operator 
concave and operator monotone on M^ for a G (0, 1] and operator 
convex on M.^ for a G [1,2]. (Note that these functions are convex 
but not operator convex if a > 2.) 

We start with a straightforward application of Jensen's inequality: 

Lemma 2.7. Let $7 be an interval on M and let f : Q ^ M be concave on 
r2. Then, for any \ip) €z Ji with \\ip\\ = 1 and A G T-L{'K) with eigenvalues 
in VL, we have {ip\f{A)\ip) < f {{ip\A\ip)) . 

Proof. Using the eigenvalue decomposition A = ^^ Ai|ej)(ei|, we get 

i i 

where we used that ^^ |(c/?|ej)p = 1. D 



^For example, using the SeDuMi solver [SED] and YALMIP front-end [L6f04]. 
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A comprehensive generalization of Jensen's inequality to operator con- 
vex functions is given in [HP03]. We state a specialized version here for 
completeness. 

Lemma 2.8 (Operator Jensen's Inequality). Let Q be an interval on M 
and let f : Q ^ M be continuous and operator concave on Q. Then, for 
any isometry U : "K ^ Ji' and A £ TiCX) with eigenvalues in O, we have 



Uf{A)U^<f{UAU^) 
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Chapter 3 

The Purified Distance 



This chapter is based on [TCRIO], where the purified distance was first 
proposed as a metric on the space of sub-normahzed quantum states. The 
usefulness of the purified distance will become apparent when it is applied 
to define the smooth min- and max-entropies in Chapter 4, providing them 
with natural properties such as invariance under local isometrics and var- 
ious data processing inequalities. 

3.1 Introduction and Related Work 

Smooth entropies, evaluated for a quantum state p, are defined indirectly 
via an optimization (either a maximization or a minimization) of an under- 
lying unsmoothed entropy over a set of states that are e-close to p, where 
e is a small smoothing parameter. The resulting quantities are called e- 
smooth entropies. (See, for example, Chapter 4, where the e-smooth min- 
and max-entropies are defined in this way.) 

Consequently, various definitions of such sets of close states — subse- 
quently called e-halls — have appeared in the literature. However, to the 
best of our knowledge, none of the existing definitions simultaneously ex- 
hibit the following two properties that are of particular importance in the 
context of smooth entropies in the quantum regime. 

• Firstly, the definition of the e-smooth entropies should be indepen- 
dent of the Hilbert spaces used to represent the quantum state p. 

In particular, embedding p into a larger Hilbert space prior to smooth- 
ing should leave the e-smooth entropies unchanged. Note that, in 
general, embedding p into a larger Hilbert space offers more flexi- 
bility for smoothing as more dimensions orthogonal to the support 
of p become available for the optimization. Indeed, for some e-balls 
that contain only normalized states, smoothing outside the support 
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becomes advantageous and the smooth entropy thus depends on the 
Hilbert space representation of p. 

We can avoid this problem by including sub-normalized quantum 
states in the e-balls. 

Secondly, it will be important that we can define a ball of pure states 
that contains purifications of all the states in a given e-ball. This will 
allow us to establish duality relations between smooth entropies and 
is achieved by using a fidelity-based metric to determine e-closeness. 



3.1.1 Main Contributions 

The following sections introduce a new metric on sub-normalized quantum 
states, the purified distance. We call two quantum states p and r e-close, 
denoted p w^ r, if and only if the purified distance between them is at 
most e. The purified distance has various interesting properties, among 
them are the following. 

Result 1 (Purified Distance). The purified distance is a metric on sub- 
normalized states and has the following properties: 

• If at least one of the states is normalized, it can be expressed in terms 
of the fidelity as P{p, t) = y/l — F{p,tY . 

• It is an upper bound to the trace distance. 

• For any trace non-increasing CPM £ and any states p and a, we 
have p^e T =^ 8.[p] »e £[r]. 

• If Pab is a state and a a ~e Pa is close to its marginal, then there 
always exists an extension Uab of a a with Uab ~e Pab ■ 



3.1.2 Outline 

Section 3.2 introduces two metrics on the set of sub-normalized states, the 
generalized trace distance and the purified distance. In Section 3.3, we 
discuss various properties of the purified distance, including its relation to 
the generalized trace distance, its monotonicity under trace non-increasing 
CPMs and an adaption of Uhlmann's theorem to the purified distance. 
Section 3.4 then explains some notational conventions, made possible by 
the use of the purified distance as a metric, that should help the reader 
through the remaining chapters. 
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3.2 Two Metrics for Quantum States 

For the remainder of this chapter, let "K be an arbitrary finite-dimensional 
Hilbert space. ^ The two most common measures of distance between nor- 
malized quantum states are the trace distance and the fidelity. 

3.2.1 Generalized Trace Distance 

We start by introducing a straight-forward generalization of the trace dis- 
tance to sub-normalized quantum states. 

Definition 3.1. For p,T £ 5<(!K), we define the generalized trace dis- 
tance between p and r as 

D{p, t) := max { tr{p - t}+, tr{r - p}+} . 

The generalized trace distance can be expressed alternatively in terms 
of the Schatten 1-norm as 

D{p,t) = -\\p-t\\i + -|tr/9-trr| 

and it is easy to verify that it is a metric on £(IK). In the case where 
both p and r are normalized states, we recover the usual definition of the 
trace distance, D{p,t) := ^\\p — r||i. Furthermore, the trace distance has 
a physical interpretation as the distinguishing advantage between the two 
states. In other words, the probability Pdist(P) ''") of correctly distinguishing 
between two equiprobable states p and r by any measurement is upper 
bounded by [NCOO] 

Pdist{p,r)<^{l + D{p,T)). (3.1) 

3.2.2 Generalized Fidelity 

On the other hand, various metrics are derived from the fidelity, which is 
given as F{p,t) = \\^/p\/t\\i for two normalized states p and r. We will 
not use the letter F to denote the fidelity hereafter, instead reserving it 
for the generalized fidelity defined below. (We will also see that the two 
quantities agree if at least one state is normalized.) The fidelity has many 
interesting properties, some of which we will list here for further reference. 
The most important properties of the fidelity are summarized in Table 3.1. 
Here, we propose a generalization of the fidelity to sub-normalized 
states. The generalization is motivated by the observation that sub-nor- 
malized states can be thought of as normalized states on a larger space 



^Note also that most results of this chapter have recently been generalized to the 
framework of general von Neumann algebras [BFSll]. 
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The following holds for arbitrary positive operators p,T,a G V{'K). 
i. The fidelity is symmetric in its arguments, iy^-v/rlli = llv^-v//'lli- 
ii. The fidelity is monotonously increasing under the application of 
TP-CPMs (cf. e.g. [NCOO], Theorem 9.6). This implies that, for 
any TP-CPM £, we have 



/trpvtrr > 



^/m^/W] , > WVpV^I ■ (3.2) 



1 "1 



To get the first inequality, we used that tr is a TP-CPM. 
iii. Uhlm,ann's theorem [Uhl85] states that, for any purification (/? of p. 



||V^/F||i=rnax|(v.|^)|, (3.3) 

where the maximum is taken over all purifications i? of r. 
iv. For any projector II G 'P(^), we have 



||v%nV^||^ = ||yii^\/iMi||^ = ||VpVimi||^. (3.4) 



V. For any a > p, we have ||\/o"V^lli ^ iv^V^ii- 

vi. For states p = pi (B P2 and r = ri © T2, where pi,Ti £ VCXi) and 
P2,T2 € V{'K2), we have 



VpVt^Wi = ||\/pi\/n||i + ||\/p2\/7^||i 



Table 3.1: Properties of the Fidelity. 

which are projected onto a subspace. Hence, we define the generalized 
fidelity as the supremum of the fidelity between such normalized states. 

Definition 3.2. For p,T E 5<(!K), we define the generalized fidelity be- 
tween p and T as 

F{p,t) := sup sup VPVf , (3.5) 

where the supremum is taken over all embeddings V of "K into "K' and all 
states p, t € S={'K'), such that p and r are images of p and f under V'. 
(Namely, the states satisfy V^pV = p and V^fV = t.) 

This expression reduces to the fidelity when at least one state is nor- 
malized. To see this, consider the following alternative expression for the 
generalized fidelity. 
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(3.6) 



Lemma 3.1. Let p,T € 5<(!K). Then, 

Fip,r) = F{p,T) = W^pV^W^ + V(l-tr/5)(l-trT) 

where p := p(B (l—tip) and f := r © (1— trr). 

Proof. Let U : 'K ^ 'K' , p and r be an arbitrary candidate for the supre- 
mum in (3.5). Moreover, let £ be the pinching p i— )■ npll + n^pll^, where 
n := UU"^ projects onto the image of U and n-"- := 1 — 11 is its orthogonal 
complement on ^i' . Then, due to the monotonicity property (3.2), we find 



y/pVr 



< 






+ 



y/U^pU^^/U^pW 



tr p)(l — trr) 



In particular, the r.h.s. is an upper bound on F{p,t). Finally, it is easy 
to verify that this upper bound is achieved with the choice "K' = !K(BC as 
well as p and f. D 



3.2.3 Purified Distance 

Next, we define a distance measure based on the fidelity, analogously to 
the one proposed in [GLN04, RasOG]""^. 

Definition 3.3 (Purified Distance). For p,T £ 5<(^), we define the pu- 
rified distance between p and r as 

P{p,T):=^l-Fip,Ty. 

Other distance measures based on the fidelity have been investigated 
in the literature. In particular, the Bures metric [Bur69], B{p,t)'^ := 
2(1 — F{p,t)), and the angular distance, A{p,t) := arccosF(p,r) [NCOO]. 
We prefer the purified distance because it constitutes an upper bound on 
the trace distance as we will see below and thus inherits its operational 
interpretation as an upper bound on the distinguishing advantage. 

The name "Purified Distance" is motivated by the fact that, for nor- 
malized states p,T £ S={'K), we can write P{p,t) as the minimum trace 
distance between purifications \(p) oi p and \'d) of r. More precisely, using 
Uhlmann's theorem (3.3), we have 



P{p,t) = ^1-F{p,tY 



max|((/3|'i?)p 



min y 1 — K^^l"!?)!-^ = minL'((/3, ??) 



^The quantity C{p, r) — yjl — F^{p,t) is introduced in [GLN04], where the authors 
also show that it is a metric on 5=(3f). In [Ras06], the same quantity is called sine 
distance and some of its properties are explored. 
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The purified distance is a metric on the set of sub-normahzed states 
according to Definition 2.1. 

Proposition 3.2. The purified distance P{-,-) is a metric on 5<(!K). 

Proof. Let p,T,a G 5< (IK). The condition P{p, r) = <;=^ p = t can be 
verified by inspection, and symmetry P{p, r) = P{t, p) follows from the 
symmetry of the fidelity. 

It remains to show the triangle inequality, P{p, r) < P{p, a) + P{a, r). 
Using Lemma 3.1, the generalized fidelities between p, r and a can be 
expressed as fidelities between the corresponding extensions /5, f and a. 
We employ the triangle inequality of the angular distance, which can be 
expressed in terms of the purified distance as A{p,t) = aiccos F{p,t) = 
axcsin P{p,t).^ This leads to 

P{p,t) =smA{p,f) 

< sin {A{p, a) +A{a,f)) 

= sin A{p, a) cos A{a, f) + sin A{a, f) cos A{p, a) (3.7) 

= Pip,a)F{a,T) + Pia,r)F{p,a) (3.8) 

<P{p,a) + P{a,T), 

where we employed the trigonometric addition formula to get (3.7). D 

Note that the purified distance is not an intrinsic metric, i.e. given two 
states p, T with P{p, r) < e it is in general not possible to find intermediate 
states a^ with P{p,a^) = Xe and P{a'^,T) = (1 — X}e. In this sense, the 
above triangle inequality is not tight. It is thus sometimes useful to employ 
Eq. (3.8) instead. For example, given three states p,T,a £ 5<(5{) and 
< e, e < 1, we find that P{p, cr) < e and P(o", r) < e implies 

Pip, t) < eVl -£^ + e\/\ - £2 (3.9) 

if arcsine + arcsine < |. This bound is plotted in Figure 3.1. 

3.3 Properties of the Purified Distance 

The purified distance allows simple upper and lower bounds in terms of 
the generalized trace distance. 

Proposition 3.3. Let p,T £ 5<(!K). Then 

D{p,T) < P{p,T) < ^2D{p,T) - D{p,tY < x/2Z)(p,r). 



^A proof that yl is a metric for normalized states is given in [NCOO]. 
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Figure 3.1: Improved Triangle Inequality. Contour plot of the expression 
in Eq. (3.9) with axes e and e. The boundary contains all pairs {e, e} that 
lead to the trivial bound P{p,t) < 1. This shows that Eq. (3.8) gives 
significantly tighter results than the triangle inequality for the purified 
distance if e and e get large. For example, if P{p,a) = P{(t,t) = 0.5, we 
find P(p, r) < 0.87 instead of the trivial P{p, r) < 1. 

Proof. We express the quantities using the normalized extensions p and f 
of Lemma 3.1 to get 



P{p,T) = Vl - F{p,fY > D{p,f) = D{p,T) and 
P{p,Tf = 1 - F{p,ff < 1 - (1 - D{p,f)f 
= 2D{p,T)-D{p,Tf<2D{p,T), 

where we employed the Fuchs - van de Graaf inequalities 1 — F{p, f) < 
D{p,f) < y^l-V(p,f)2 (cf. [Fuc96] and [NCOO], Section 9.2.3). D 

One very useful property of the purified distance is that it does not 
increase under simultaneous application of a quantum operation on both 
states. We consider the class of trace non-increasing CPMs, which includes 
TP-CPMs, projections and partial isometrics. 

Theorem 3.4 (Monotonicity of Purified Distance). Let p,T ^ 5<(IK) and 
8, : "K ^- J{' a trace non-increasing CPM. Then, 

F(8[p],£[r])>F(p,r) and P(£[/;], £[r]) < P(p,r). 

Proof. Remember that a trace non-increasing CPM £ : !K — )• IK' can be 
decomposed into an isometry [/ : IK — )■ IK' (8> IK" followed by a projection 
n G 'P{'K' (Si IK") and a partial trace over IK" (cf. Lemma 2.4). Isometries 
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and the partial trace are TP-CPMs and, hence, it suffices to show that 
F{p,t) < F{e[p],E[T]) for TP-CPMs and projections. 

First, let £ be a TP-CPM. Using (3.6) and the monotonicity under 
TP-CPMs of the fidelity (3.2), we see that 



F{p,r) = IIVpV^IIi + V(l-trp)(l-trT) 

< |y£Ry£M||^ + \/(l - trp)(l - trr) 
= F{e.[p],E[T]). 

Next, consider a projection UeVCK) and the CPM E. : p ^ UpU. Fol- 
lowing the definition of the generalized fidelity in (3.5), we write F(p, r) = 
sup k/pV^ 1) where the supremum is taken over all normalized exten- 
sions {5{',p, r} of {^, p, r}. Since all normalized extensions of {Ji,p,T} 
are obviously also normalized extensions of {supp {11}, 11/911, nrnj , we find 

F(npn,nrn) >f{p,t). 

Finally, the second statement trivially follows from the first one by 
definition of the purified distance. D 

The main advantage of the purified distance (and other metrics based 
on the fidelity) over the trace distance is that we can always find extensions 
and purifications without increasing the distance. This is captured in the 
following two results. 

Theorem 3.5 (Uhlmann's Theorem for Purified Distance). Let p,T £ 
S<{'K), 'K' = 'K and \ip) £ Ji (^ Ji' be a purification of p. Then, there 
exists a purification |i?) £'K(S>'K' of t that satisifies P{p,t) = P{ip,'d). 

Proof. We use Uhlmann's theorem for the fidelity (3.3) to choose |t?) G 
'K^'K' such that ||\/P\/^||i = Kvl"*^)! holds. Then, due to (3.6), we have 
F(p, r) = F{ip, 1?) as well as P{p, r) = P{ip, '&). D 

Corollary 3.6. Let p,T £ 5<(!K) and p G S<{'K(S>'K') be an extension of p. 
Then, there exists an extension f £ S<{'K0'K') ofr with P{p, r) = P{p, f). 

Proof Let Ji" = 'K'^^K' he an auxihary Hilbert space and ip £ Jii^Ji' i^^K" 
be a purification of p. We introduce a purification i? £ IK (g) !K' (gi "K" 
of r with P{ip, I?) = P{p, t) using Uhlmann's theorem for the purified 
distance above and define r = tr^^;" (-(?). However, due to monotonicity (cf. 
Theorem 3.4), we have P{(p, t?) > P{p, f) > P{p, t), which implies that all 
three quantities must be equal. D 

The next lemma offers an explicit construction that satisfies Corol- 
lary 3.6. This was shown in [DBWRIO] and we provide the proof here for 
completeness. 
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Lemma 3.7. Let p,T £ S<{'K) and p G 5<(!K ® "K') an extensions of 
p. Then, there exists an operator X G C{'K) such that f = XpX'^ G 
5<(;K(g)^') is an extension of t with P{p,t) = P{p,f). 

Proof We show the theorem for the case when p is pure. The general 
statement then follows from the same arguments outlined in the proof of 
Corollary 3.6. Let U\y/p^/T\ be the polar decomposition of y/p\/T- Then, 
we define X := T^''^Up~^'^ using the generalized inverse. Clearly, r = 
XpX' is an extension of r as tv^'if) = XpX' = r. Furthermore, we find 

||y;^V^||i = \{p\f)\ = \{p\X\p)\ = \tr{Xp)\ = \tr{U^V^)\ = \\VpH\r 

The equality of the purified distance then follows by Lemma 3.1 and the 
definition of the purified distance. D 

The following lemma (see also [BCC^IO, TRll]) gives a bound on the 
distance between a state and the projection of that state onto a subspace. 

Lemma 3.8. Let p G 5<(!K) and 11 G VCK) be a projector, then 

P{p,UpU) < ^2 tr(n-Lp) - tr(n-L/))2 , 
where H^ := 1 — 11 is the complement of II on IK. 

Proof The generalized fidelity between the two states can be bounded 
using tr(np) < ti{p) and L/pVUpn = tr(np), which follows from (3.4). 
We have 

F{p, UpU) > tT{Up) + 1 - trip) = 1 - tr{U^p) . 

The desired bound on the purified distance follows from its definition. D 

3.4 Notational Conventions 

We will often use the notation /? ~e r (in words, "p is e-close to r") to say 
that P{p,t) < E, where e <C 1 is a small parameter and p,T £ 5<(IK). 

The following intuitive notational conventions will be used widely in 
the following chapters and shortens many proofs. Let ^ab be a bipar- 
tite Hilbert space. If Pab £ <S<{^ab) and Pa G <S<{^a) are defined, we 
implicitly define Pab as the extension of Pa that has minimum purified 
distance to Pab according to Corollary 3.6 and Lemma 3.7. This implies 
that P{pab,Pab) = P{pa,Pb)- 

Generally, states with the same greek letter will be treated in this way, 
where the distance of the modified letter states (e.g. p, a, f) is always mea- 
sured with regards to the unmodified letter state (e.g. p,a,T). Consistent 
with that, if an extension is introduced that is not of minimum distance, 
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we will always use another greek letter to denote it. Due to the argu- 
ments above, it is usually sufficient to write P{p., p) without mentioning 
the subspaces on which the states are compared. 

These minimum distance extensions often inherit properties of the orig- 
inal state. For the example discussed above, if p is classical on B then p is 
also classical on B. This follows directly from the monotonicity (cf. The- 
orem 3.4) of the purified distance under trace non-increasing maps, since 
P{M{p\^ p) = P{M[p\^Ai[p\) < P{p,p), where M. measures in the classi- 
cal basis of B and thus leaves p invariant. Using the same argument, one 
can show that the minimum distance extension p^^g lives in the subspace 

"Ka (gl SUpp {Pb} of ^AB- 

Furthermore, if £ : JIa — )• ^b is a trace non-increasing map and Tg = 
S.[pa], then, the definition of a state p ^s P ^ilso implicitly defines a state 
f = 8,[pa] ~£ T. This will often be used when £ is an isometry. In 
this case, we often give the states the same letter, for example ii U = 
Xlx \^)x ® I^)x'(^Ia is ^^^ isometry that purifies a projective measurement 
of the system A in the basis {|x)}, we use pxx' = UpaU^ to denote the 
mapped state. 
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Chapter 4 

Min- and Max-Entropies 



This chapter formany introduces the min- and max-entropies for quantum 
states and discusses some of their properties. We provide a plethora of 
different expressions for the min- and max-entropy and introduce the in- 
terpretation of the min-entropy as a guessing probabihty. Moreover, we 
explore the classical limits of the min- and max-entropy and investigate 
their continuity and the concavity of the max-entropy. 

4.1 Introduction and Related Work 

We have seen in the introduction that generalizations of the classical Renyi 
a-entropies [Rcn61] can be used to characterize different information the- 
oretic tasks in the one-shot setting. For a discrete probability distribution 
Px over a set X, the Renyi a-entropies are defined as 

i/„(X)p:=-^logVP^(x)", where a G (0, 1) U (l,oo). 
I — a ^ — ' 



X 



These entropies have a trivial generalization to the quantum setting, which, 
for a state p G S={'Ka), is given as 

/?„(^),:=^logtr(p-). (4.1) 

The range of allowed a can be extended to include and oo by taking the 
respective limits of (4.1). This leads to the expressions 

H^{A)p := -logl^loo and Hq{A)p := log rank {/).4}- 

Furthermore, taking the limit to a = 1 from both sides reveals the von 
Neumann entropy; hence, we set Hi = H and have now defined a spectrum 
of entropies for a £ [0,oo]. These entropies are monotonically decreasing 
in the paraemter a, i.e. 

a > /3 ^ H^{A)p < Hp{A)p for all p G S={'Ka). 
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One of the first questions to answer is now whether we need to con- 
sider the whole spectrum of Renyi a-entropies for our framework for non- 
asymptotic information theory. This was answered to the negative by 
Renner and Wolf [RW04, RW05]. They show that if we allow a small vari- 
ation of the state of the system — in the following called smoothing — these 
entropies can be separated into three classes, the elements of each being 
approximately equal. The three classes are: the von Neumann entropy, 
the a-entropies with a < 1 and the a-entropies with a > 1. 

To follow their argument, we define e-smooth Renyi a-Entropies, 

^ , ., I miuo H,JA)o if a < 1 

HUA)p=\ " ";;; .^ , where 0<e<l 

I maxp H^{A)p it a > 1 

and the optimization in each case is over an e-ball of close states, p ~e p. 
Note that we smooth in the direction of the von Neumann entropy in both 
ranges, a < 1 and a > 1. 

These entropies now satisfy the following inequalities. (The proof of 
these statements can be adapted from results in [RW04, RW05].) 

^V2^'' ^^)^ - iTa ^°S ^ < ^r^' (^)p < m;^ {A), + 2\og^ (a < 1) , 
H^^^\A), + -L-iogl>H^^{A),>H^J,iA), (a>l). (4.2) 

Note that the deviation terms in a and e do not depend on properties 
of the state. Hence, if the entropies are large enough, these terms will be 
negligible in comparison. 

On one hand, the second statement of Eq. (4.2) thus shows that smooth 
Renyi entropies of order a > 1 are well approximated by the smooth Renyi 
entorpy of order oo. We choose the min-entropy, H^^^{A)p = H^(A)p, 
as the representative of his class of Renyi entropies. The choice of the 
Renyi cxD-entropy is motivated by its operational interpretation as a guess- 
ing probability (see [KRS09] and Section 4.4) as well as the fact that its 
quantum generalization has a simple form that can be expressed as a semi- 
definite program. 

On the other hand, the first statement of Eq. (4.2) implies that the 
smooth Renyi entropies of order a < 1 are well approximated by the 
smooth Renyi entropy of order ^. We call this entropy the max-entropy, 
iJjnax(^)p = -^i/2(^)p- "^^^ choice of this entropy as a representative of 
the class may seem arbitrary at this point. Indeed, it could be argued that 
the Renyi entropy of order is also a natural choice, as it characterizes 
such tasks as the amount of memory needed to store the output of a source 
perfectly in the one-shot setting.^ However, the choice of ^ is motivated 



^In fact, the initial extension to the fully quantum setting was done for the Renyi 
entropy of order [Ren05]. 
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by the duality relation of the conditional min- and max-entropies, which 
holds for this choice of the max-entropy (see [KRS09] and Lemma 4.1). 

Conditional Renyi entropies can be defined in various ways. In analogy 
with the von Neumann entropy, we might be tempted to define H^{A\B) = 
H^{AB) —H^{B) . However, to the best of our knowledge, this definition is 
not very useful to characterize information theoretic tasks in the one-shot 
setting. In this chapter, we propose a natural generalization of the min- 
entropy that is motivated by its operational interpretation as a guessing 
probability. Our generalization of the max-entropy then follows immedi- 
ately from the duality of the min- and max-entropies. 

Quantum generalizations of the min- and max-entropies were first con- 
sidered by Renner and Konig [RK05, Ren05] in order to investigate security 
in quantum cryptography and related tasks, e.g. information reconciliation. 
They considered a generalization of the Renyi-entropy of order and two 
variations of the conditional min-entropy as well as a different method 
of smoothing than the one proposed in this thesis. The smooth entropy 
framework has been consolidated since and we attempt to summarize the 
most important results in this and the following chapter. 

4.1.1 Main Results 

The main result of this chapter is a collection of expressions for the quan- 
tum conditional min- and max-entropies. 

Result 2 (Expressions for the Min- and Max-Entropy). Let Pabc be a 
pure quantum state, then 



Hrae.M\C)p = log min { ||Zc ||^ : Zac G V{^ac) A Pabc < Z^c ® Is} 
= log dA max F^ (pac, tta (?) CTc) 

= log dA max max F^ {pab'b" , Iab' ® Ts") 

B -> B'b" t 

= log min { tr(o-) : a £ V{'Kb) A Pab < 1a i^ o-js} 



• 1 II -V2 -1/2 1 

mmlog o-s PabCFb 



oo 
.A. 



= min inf JA G M : p^s < 2 1^ ^c^b] 

where a, r are quantum states and B — t- B'B" is an embedding. 

In particular, this result can be interpreted as follows. The expression 
maxs ^ g'B" max,- F(^Pab'b" , Iab' ® Tb") measures the fidelity with a state 
that corresponds to an omniscient observer B = B'B" of the system A. 
Any such observer necessarily controls a system B' that is fully entan- 
gled with the system A and may, in addition, control a system B" that 
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is uncorrelated with A. Moreover, the expression maxa- F [p ^c , t^ a ^ <^c) 
measures the fidehty with a state that corresponds to an ignorant observer 
of the system A. Such an observer C necessarily holds a state ac that 
is product with the system A. Since these two quantitates are equal, we 
find the following: For any pure state Pabc, the marginal Pab is close to 
an omniscient observer B of the system A if and only if p^c is close to an 
ignorant observer C of system A. 

4.1.2 Outline 

In Section 4.2 we formally introduce the conditional min- and max-entropies 
and show how they can be expressed as semi-definite programs. We also 
formally introduce the von Neumann entropy. In Section 4.3 we evaluate 
the conditional min- and max-entropies for classical probability distribu- 
tions, and in Section 4.4, we explore the interpretation of the min-entropy 
as a guessing probability. Section 4.5 then discusses various properties 
of the min- and max-entropy. Most importantly, we give first bounds on 
the min- and max-entropies and show that these entropies are continuous 
functions of the state. 

4.2 Min- and Max-Entropies 

4.2.1 The Min-Entropy 

The min-entropy of A conditioned on B has already been introduced in 
previous chapters as a measure of closeness to a state describing a perfect 
observer of the A system. We start with alternative definitions of the min- 
and max-entropy and then develop a variety of alternative expressions. 

Definition 4.1. Let p^s G '5<(IK^s). The min-entropy of A conditioned 
on B of the state p^B is 

H^^{A\B)p := max sup {A € M : Pas < 2"^!^ ^ (Tb}, (4.3) 

where the maximum is taken over all states a S 5<(5{b). 

Note that there exist feasible A only if suppjiTs} 5 suppjps}. How- 
ever, if this condition on the support is satisfied, there exists a feasible 
A* = —log II CTs PabO'b II which achieves the supremum. The min- 
entropy can thus alternatively be written as 

Hmmi^\B)p = max -log WcTb^^^PabCTb \\^,, (4.4) 

where we use the generalized inverse and the maximum is taken over all 
(Tb G SkCKb) with supp{cJs} 5 suppIpb}. 
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We can also reformulate (4.3) as a semi-definite program (SDP). (Semi- 
Definite Programs are introduced in Section 2.3.1.) For this purpose, we 
include 2~ in Ub and allow the new Ug to be an arbitrary positive semi- 
definite operator. The min-entropy is then given by 

H^^^{A\B)p = -logmin{tr(o-) : a e V{:Kb) A Pab < t^^aB} 

and the optimization problem thus has an efficient numerical solver. In 
particular, we consider the SDP for the expression 2~ mini^l^ip. 

primal problem dual problem 



minimize: {1b, Cb) maximize: {pab,Xab) 

subject to : 1^ <^ ^s ^ Pab subject to : tr^(X^B) < Ig 

aB G V{:Kb) Xab G Vi^KAB) 

(4.5) 

Clearly, the dual problem has a finite solution; in fact, it is easy to verify 
that {pab,Xab) < tr(X4s) < ds- Furthermore, there exists a CTs > with 
Ia'SkTb > Pab- Hence, strong duality (Theorem 2.6) applies and the primal 
and dual solution are equivalent. 

Let us now investigate the dual problem more closely. We can replace 
the inequality in the condition Xb < 1b by an equality since adding a posi- 
tive part to Xab only increases {pab,Xab)- Hence, Xab can be interpreted 
as a Choi-Jamiolkowski state of a completely positive unital map (cf. Sec- 
tion 2.1.5) from 'Kb' — ^a to "Kb- Let £^ be that map, then 

2-^mi„(^|S)p = max {pab, e.^[TAB']) = cLa max {^[pabIIab') , 

£t ^ ' £ ' ' 

where the second maximization is over all TP-CPMs £. from B to B' , i.e. 
all super-operators whose adjoint is completely positive and unital from B' 
to B. The fully entangled state 7 = F/d^ is pure and normalized, hence, 
we can write [KRS09] 

H^i^{A\B)p = -logdAUia^F'^{8[pAB],7AB'), (4.6) 

where the maximum is taken over all TP-CPMs from B to B' . (Note 
that 7 is defined as the fully entangled in an arbitrary but fixed basis of 
"Ka — Kg'- The expression is invariant under the choice of basis, since 
the fully entangled states can be converted into each other by a unitary 
appended to £.) We write F for the generalized fidelity, which corresponds 
to the fidelity in this case as 7 is normalized. 

Alternatively, we can interpret Xab as the Choi-Jamiolkowski state of 
a TP-CPM map from Ka' — Kb to Ka- This immediately leads to the 
relation 

H^i^{A\B)p = - log dB max {pAB,^b A' b]) , 
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where the maximization is over all TP-CPMs from A' to A. 

We may now decompose the TP-CPMs of (4.6) into their Stinespring 
dilation: an isometry U : "Kg — )• ^b'b" followed by a partial trace over 
^B'- Uhlmann's theorem now implies that there exists an extension of 
Jab' to B" such that F{UpabU\ Jab'b") = F{£,[pab], Jab')- Since such 
extensions of a pure state are necessarily of the form Jab'b" = 7ab' "X" Tb", 
we recover the following expression for the min-entropy 

H^i^{A\B)p = - log (Ia max max F'^Uab'b", Iab' ® Tb"), (4.7) 

B -^ b'b" r 

where the maximization is over all isometrics from B to B'B" and states 
TeS={JiB"). 

Using the expression in (4.7), the min-entropy can be interpreted as a 
measure of distance to a state describing an observer B that is omniscient 
about A. Such an observer must necessarily hold a state 7 that is fully 
entangled with A and may, in addition, hold an arbitrary state r that 
is uncorrelated with A. The min-entropy now evaluates the distance (in 
terms of the fidelity) of p to the closest such state. 

Finally, we introduce the quantity H^:^^{A\B) , which is a trivial lower 
bound on H^^^{A\B) and is sometimes used instead of H^-^j^{A\B) [Ren05]. 

H^,^iA\B), := -log||p,'/V^,p,'/^||^ < H^,^{A\B)p. 

The inequality follows by the choice 0"^ = Pb m (4.4). It has been shown 
that the smooth versions of H^^^{A\B) and H^-^^{A\B) are equivalent up 
to terms in the smoothing paramter [TSSRll]. 

4.2.2 The Max-Entropy 

We use the following definition of the max-entropy. 

Definition 4.2. Let Pab G <S<{^ab)- The max-entropy of A conditioned 
on B of the state Pab is 

H^^^{A\B)p := max log \\^y^I^Vt^^Wo^L, (4.8) 

where the maximum is taken over all states a G 5<(IKb). 

Since the maximum is taken for normalized states a G S={'Kb), we 
may rewrite this as 

H^^^{A\B)p = max log dAF[pAB,T^A ®cfb)- 

Contrasting this to the min-entropy in (4.7), the max-entropy can be seen 
as a measure of proximity of /j to a state describing an observer B that 
is ignorant about A. Such an observer necessarily holds a state that is 
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product with the state on A and the max-entropy evaluates the fidelity 
with the closest such state. 

Introducing an arbitrary purification Pabc of Pab and applying Uhlmann's 
theorem, we rewrite this as the following optimization problem. 



2H^,,(A\B)p ^ ^^inax {pabc\tabc\Pa 



BCl 1 



where r has the marginal Tab = t^a® <^b for some a £ 5<(!Ks). This is 
the dual problem of the following SDP: 

primal problem dual problem 



minimize: p maximize: {Pabci^abc) 

subject to : /il^ > tr^(Z_4B) subject to : trc(yABc) ^ 1a "Xi fs 

Zab fX" Ic > Pabc tr(o-B) < 1 

Zab £ 'P{'Kab) Yabc £ 'P{'Kabc) 

P>0 as G V{^b) 

Again, it is easy to verify that the dual problem has a finite solution. To 
see this, note that tr(y) < cIa due to the constraints in the dual problem, 
hence, the maximum cannot exceed d^ for normalized states. Moreover, we 
can easily construct a primal feasible solution with Zab (SD Ic > Pabc and 
pIb > Zg. Hence, strong duality (Theorem 2.6) applies and the primal 
and dual solution are equivalent. 

The primal problem can be rewritten by noting that the optimization 
over p corresponds to evaluating the oo-norm of Zg. 

^max(^l^)p = logmin [\\Z4^: Zab ^ Ic > Pabc, Zab S ^(IK^s)} . 

(4.9) 

This can be used to prove upper bounds on the max-entropy. For 
example, the quantity H^^^{A\B) — which is sometimes used instead of 
the max-entropy [Ren05] — is an upper to H^^^{A\B). 

H^^^{A\B)p := logmaxtr(m4si^ (g) Gb) > H^,,{A\B)p . 

(7 

This follows from (4.9) by the choice Zab = H^-^^, which is the projector 
onto the support of Pab- Note also that smooth versions of H^^^{A\B) 
and H^^^{A\B) are equivalent up to terms in the smoothing parame- 
ter [TSSRll]. 

Furthermore, Konig et al. [KRS09] showed that the max-entropy can 
be expressed as a min-entropy of the purified state. 

Lemma 4.1. Let p G S<{'J{abc) be pure. Then, 
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Proof. We show that 2-'^max(-4|B)/^^ _ 2~-'^min(^|C)/c?^ using the expres- 
sion (4.6) for the min-entropy and Def. 4.2 for the max-entropy, i.e. we 
will show that 

max\\^/f^^^/^uWo^\\-. = max max 11 ^//u^^^^VrA^y^Tb^lL 

^ M V "i- c ^ C'C" T " ^ "J- 

where the maximization is over quantum states a G S^CKg) and Tq" G 
S<{Jic") as well as embeddings C — )• C'C". 

Due to Uhlmann's theorem (3.3) and the fact that the fidelity cannot 
decrease under partial trace, we have 

II ^TpI^^/t^a «> o-fl II 1 = max \^ p abC c" ^ 1 ac ® •^bc"\-^ 

" " J^ C ^ C'C" " "^ 

< max max || v/Pac'c'VTac ® '^cii , 
C ^ C'C" T " ^ "J- 

where |(/3) is any purification of a and the fully entangled state I7) purifies 
the fully mixed state vr by definition. Since this holds for every a, it 
particularly holds when a maximizes the fidelity in the definition of the 
max-entropy. On the other hand, 

\-s/Pac'c"^1ac' ® Tc" 111 = max || ^ypABc'c" VIac' '^ fsc" \\ , 

M 

< max \\^/Jhu3^/^L^W(T^L. 

Since this holds for all embeddings C — >• C'C" and all states r, it particu- 
larly holds for the tuple that achieves the maximum fidelity in (4.6). Thus, 
we established the equality of the two expressions. D 

4.2.3 The von Neumann Entropy 

For completeness, we also define the von Neumann entropy for sub-normalized 
states. 

Definition 4.3. Let Pab G '5<(IK^s). Then, the von Neumann entropy is 

H{A\B),:=H{p^s)-H{ps), where H{p) := -ti{p\ogp) . 

We may rewrite this as an optimization problem. For any a G S<:{'Kb) 
with tr((T) < tr(/9), we find 

H{A\B)p = tr(/j^s (1a (g) log CTs -logp^s)) - tr (ps(logo-B -logps)) 
> tr (pas ( 1a (g) log ag - log Pab))- 

We used that the second term, the relative entropy tr(/9s(logcrs — log/Os)), 
is non-negative thanks to Klein's inequality [Kle31]. Furthermore, the term 
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vanishes for the choice a^ = p^. If we optimize the expression over all 
o" G S<{!Kg) with tr(o-) < tr(p), we therefore get 

H{A\B)p = maxtr {pAsi^A «> logcJs - logp^s))- (4.10) 

We have seen that the min- and max-entropy satisfy a duality relation 
in Lemma 4.1. Here, we show that the von Neumann entropy also satisfies 
a duality relation. Note that H{p) = ^^ Aj log y: is a function of the 
eigenvalues {Aj} oi p only. Due to the Schmidt decomposition of any pure 
state Pabc, it holds that 

HiA\B)p = H{pab) - H{ps) = H{pc) - H{pAc) = -H{A\C) . (4.11) 

4.3 Relation to Classical Entropies 

Here, we evaluate the min- and max-entropy for states on two classical 
registers, X and Y . Their content is described by a joint probability 
distribution Pxy encoded in the state 



PxY = ^ Pxy{x, y) \x){x\x 



x,y 



4.3.1 Classical Min-Entropy 

In order to evaluate the min-entropy H^^^j^(X\Y)p = H^-^^{X\Y) p for this 
state, we consider the SDP for the min-entropy. We use the operators 

'^Y = Y1 Pxrixl y) \y){y\ and x^b = Y^ k!)(a;^| ^ \y){y\ , 



where xl is the x that maximizes PxY{x,y) for a given y. It is easy to 
verify that ay is primal feasible and X^b is dual feasible. Since they both 
give the same (upper and lower, respectively) bound on the min-entropy, 
we get the equality 

H^^{X\Y), = -\ogY,PxY{xty) = -log5;P^(y)2-^..inW^^ 

y y 

(4.12) 

where H^:^^{X)pv = logmax^, Px{x) is the min-entropy of X evaluated for 
the conditional probability distribution Px{x) = PxY{x,y)/PY{y). 

4.3.2 Classical Max-Entropy 

For the max-entropy H^g^^{X\Y)p = H^^^{X\Y)p the calculation is a bit 
more involved. First, note that we can assume that aY in the optimization 
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of Def. 4.2 is classical in the same basis as py If the fidelity F{pxY, Ix^cry) 
is maximal for a given ay, we can always measure both py and ay in the 
classical basis of py This operation cannot decrease the fidelity due to (3.2) 
and the measured state ay thus achieves the optimum and is of the form 

(Ty = y^Qviy) \y){y\ , where Qy is a probability distribution. 

y 

Hence, we maximize over all probability distributions Qy so that 

H^,,{X\Y), = max log ( ^ y^PM^)VQAy)f ■ 

x,y 

It is straightforward to verify — for example using the method of La- 
grange multipliers — that the optimal Qy is proportional to the map y i— ;■ 
( Ylix \/^xY{x,y)) ■ Hence, if we insert the (normalized) optimal Qy into 
the above equation, we get 

H^,^{X\Y), = log[Y.[Y.VPxAx,y)f) = log5^P,(y)2^-aJX)p.^ 

y X y 

where H^g^^{X)py = 21og^^, \/ Px{x) = Hy^{X)py is the Renyi entropy 
of order 1/2. 

4.4 Guessing Probability 

The classical min-entropy H^:^^{X\Y)p in (4.12) can be interpreted as a 
guessing probability. Consider an observer with access to Y. What is 
the probability that this observer guesses X correctly, using his optimal 
strategy? The optimal strategy of the observer is clearly to guess the x 
with the highest probability conditioned on his observation y. As before, 
we denote the probability distribution of x conditioned on a fixed y by P^ . 
Then, the (average) guessing probability is given by 

Y^Pyiy) max Pi (x) = 2-^n.in(^l^)p (4. 13) 

y 

It was noted by Konig et. al. [KRS09] that this interpretation of the 
min-entropy extends to the case where Y is replaced by a quantum system 
B and the allowed strategies include any measurement of B. 

Consider a CQ-state pxB = Sa, -^'^(^)I^X^I ® ^% with o;^ G S={^b)- 
For states of this form, the expression in (4.6) simplifies to 

2-^n.i„(^|s)p = max(r^s'EPx(x) \x){x\ ® t[u%]\TxB') 

X 

= maxN^Pjf (x) (x|£[a;g]|x) . 

X 
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This expression reaches its minimum when £ is a measurement, i.e. a map 
from S to a register 'Jig' = 'Jix- Moreover, the expression optimizes the 
probability that S-[u}%] is mapped to \x) over ah such measurement. We can 
interpret \x)b' as the observer's guess of the value x and thus 2~^min(^l^)p 
is the guessing probability in the sense described above. ^ 



4.5 Properties of the Min- and Max-Entropy 

Many properties of the min- and max-entropies (and their smooth variants, 
introduced in Section 5.2) can be derived from properties of the SDP for 
the min-entropy, Eq. (4.5). Let us consider the functional ^a\b '■ Pab *-^ 
2~^min(^l^)p^ given by the SDP, which we extend to arbitrary Hermitian 
arguments K G T-LCKab)- We use the shorthand notation $ = ^a\b in the 
following. We restate the SDP here for the convenience of the reader. 

primal problem dual problem 



minimize: (1b,(Tb) maximize: {Kab^X^b) 

subject to : "^a® '^b'^ K^b subject to : tr^(X^s) < 1^ 



Ob G Vi-K^) Xab G Vi-K 



AB ) 



We may express the min- and max-entropies of A conditioned on B 
of a pure state p £ S<{'Kabc) as follows. The min-entropy is given as 
H^-^{A\B)p = -log$A|s(/OAs) and the max-entropy as H^^^{A\B)p = 
\og^A\ciPAc)j by duality. The most important properties of the functional 
<1> are listed in Table 4.1. 

Some properties can be readily verified by close inspection of the SDP 
and we will only provide their proof when necessary. 

Proof of Property in. The inequality follows from the fact that dual fea- 
sibility of Xab is independent of the argument of ^. Hence, if Xab is 
optimal for K + L,we have ^{K + L) = {K, X) + {L, X) < ^{K) + $(L). 
For equality, consider the optimal primal for K + L, which we denote 
as- Then, n^SfTsII^^ is primal feasible for L and H^^aB^^^ is primal 
feasible for K. Hence, if these projectors are orthogonal, we have 

^{K) + $(L) = tr((n^s + H^s)cJs) < tr(fT) = <^{K + L). D 

Proof of Property vi. Let 0"^ be the optimal primal for M. Then, £,[a] is 
primal feasible for £[M] since 

1a^(Jb> Mab ^ tA® £[as] > £[Mab] . 



^Note that the special case when B is classical is recovered by setting £ : |y) i— >■ \xi). 
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Let K,Le 'H{'Kab) and M e V{'Kab). 

i. Multiplication with scalar: Let A > 0. Then, $(AK) = \^{K). 

ii. Monotonicity: K> L => ^{K)>^{L). 

iii. Sub-Additivity: (^{K + L) < (^{K) + (^[L). Furthermore, equality 
holds if Kg and Lg have orthogonal support. 

iv. Convexity: ^{\K + {1 - \)L) < X<^{K) + (1 - A)$(L) for A G [0, 1]. 

V. Invar iance under Isometrics: <1> is invariant under local isometrics 
on the AoT B system. Namely, ^a'\b' {{U(E)V)K{U'<0V'<)) = ^{K), 
where U : "K^ — )• ^a' and V ■ ^b — ^ ^s' are isometrics. 

vi. Data Processing: Let £ be trace non-increasing CPM from B to B' 
and let 3" be a sub-unital CPM from A to A' (3"[1a] <^a')- Then, 
we have $A|B'(£[Af]) < $(M) and <^a'\b{3'[M]) < $(M). 

vii. Data Processing on Extension: Let £ be a trace non-increasing 
CPM from C to C". Then, $(£[M]) < $(M). 

viii. Tensor Product: $a^a,|s^s,(L (g) K) = $_4^|bi(L) $a,Is,(K). 
Table 4.1: Properties of ^. 

Moreover, tr(£[(7]) < tr(cr), concluding the proof of the first statement. To 
prove the second statement, note that a is primal feasible for 3'[M] since 

tA®aB> Mab =^ 1a' ^aB> S'IIa] ® (Tb > 3'[Mab] , 

where we applied the defining property of sub-unital maps. D 

Proof of Property vii. Lemma A.l establishes that trc(£[M]) < tr c{M). 
The property then follows from monotonicity. D 

Proof of Property viii. If aB^ is primal optimal for L and iTb^ is primal 
optimal for K, then aB^ (E> cts^ is primal feasible for L ® K. Moreover, 
if Xa-^^bi is dual optimal for L and Xa^b^ is dual optimal for K, then 
Xa^bi ®Xa2B2 is dual feasible for L®K. Hence, the equality follows from 
properties of the inner product. D 

4.5.1 First Bounds and Order 

We first show the following technical lemma. 

Lemma 4.2. Let K G 1-L{'Kab), d*A = rank{trs{i^}+} < d^ and d% = 
rankjtr^l/'i'}-!-} < (Ib- Then, 

;^tr{K}+ < ^K) < mm{d*^,dl}tr{K}+. 

"■A 
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Proof. Using the eigenvalue decomposition K = Yli'^i IV'^Xv^'l ^^^ Prop- 
erties i. to iii., we find 

^ i:Ai>0 ^ i:Xi>0 

Since the 93* are normafized and pure, we have, 95* < Ia ^X" H'^b and, 
thus, (Ts = U.'^B is primal feasible. This means that $(v*) < tr(n'^s) = 
tr(IP'^) < ioam{d\, d*g} , where the equality follows from the Schmidt de- 
composition and the latter inequality from 

suppj^Jg} C supp{trA{K}+} and suppjc^?^} C supp{trB{K}+}. 

This concludes the proof of the upper bound. 
On the other hand, we use that 

d\ d\ 

is dual feasible. Hence, <^{K) > tr(KAsX4s) > ;^tr{i<:}+. D 

"•A 

Using these properties, we now establish bounds on the conditional 
min- and max-entropies in terms of the support of the marginals of the 
state. This also establishes that — for normalized states — the min-entropy 
is always smaller than the von Neumann entropy, which is in turn smaller 
than the max-entropy 

Proposition 4.3. Let Pab G S<{^ab), d\ = rank{/9A}; d*g = rankjps} 
and d\g = ranklp^s}. Then, using t = l/ti{p), the following hounds 
hold: 

-\ogm.\Ti{d\,dl} < H^^^{A\B)^ - logi 
<tH{A\B)p 
< ^max(^l^)p + logt < \ogmm{d\,d\s}. 

Proof The lower bound on the min-entropy follows directly from Lem- 
ma 4.2. To get the second inequality, we consider the tuple {(Ts,A} that 
optimizes the min-entropy in Def. 4.1. Hence, 1^ ^ ^^{p)o'b > i^{p)'^^PAB- 
We now plug tY(p)aB into (4.10), providing us with a lower bound on the 
von Neumann entropy. 

H{A\B)p > tr (pas(1a (S) log (tr(p)cjB) - logp^s)) 

> tr {pab {log {tr{p)2^pAB) - log Pab)) 
= tr(p)(A + logtr(p)), 
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where we used the operator monotonicity of the log function (cf. Sec- 
tion 2.3.2) to get the second inequahty. This estabhshes the second in- 
equahty of the statement of the lemma. 

The third and forth inequality follow symmetrically from the first 
two inequalities applied to the marginal state Pac of a-ny purification 
p € S<{^ABc) of p together with the duality relations H^g^^{A\B) = 
-H^-^{A\C) and H{A\B) = -H{A\C). Note also that d*c = d\g due 
to the Schmidt decomposition. D 

For example, the above lemma implies that the min- and max-entropy 
of normalized pure states is at most zero and that the min-entropy cannot 
exceed the max-entropy for normalized states. 

4.5.2 Continuity 

For classical-quantum states, the operational interpretation of the condi- 
tional min-entropy as a guessing probability (cf. [KRS09]) already implies 
its continuity in the state. To see this, note that a discontinuity in the 
guessing probability could be detected experimentally using a fixed num- 
ber of trials (the number depending only on the required precision) , hence 
giving us the means to distinguish between arbitrarily close states for a 
cost (in terms of the number of trials) independent of their distance. For 
sufficiently close states, this would contradict the upper bound on the 
distinguishing advantage, Eq. (3.1). Here, we make this statement more 
precise. 

Proposition 4.4. Let p,T £ S^CKab) and 6 := D{p,t), then 

dArcim{dA,dB} 



H^^iA\B),-H^,iA\BU<6 



In 2 • min{tr(p), tr((j)} 

Proof. We use continuity of the functional <1> to obtain 

$(r) = $(/) -F (r - p)) < $(p) + $(r - p) 

< <^{p) + mm{dA, dg} trjr - p}+ < $(/)) + minjd^, dg} 6 . (4.14) 

Note that $ > for all states in S<{!Kab)- Taking the logarithm and using 
the bound ln(a -|- x) < In a -|- -, we find 

1 ^^ ^ 1 ,. s , .mini dA,(iB} . .c?Amin{dA, (is} 
log^(r)-log^(p)<^ ln2.cl>(p) ^^ ln2-tr(p) " 

The same argument also applies on exchange of p and r and we obtain the 
statement of the lemma by substituting H^^^{A\B)p = — log <&(/)). D 
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Remark 4.5. The above result is tight in the following sense: Consider a 
system with Hilbert spaces "K^ and "Kg = !K^'©CKs', where "Ka' — ^a- Let 
Jab be the normalized fully entangled state on 'Ka^'^^a' o,nd Gb G (S<(5{s') 
he orthogonal to jg . The choice p = t^a® <7b o,nd t = p + 5j for some 
small (5 > leads to D{p, r) = 5, 

iv(p) 
$(p) = — ^ and $(r) = $(/)) + 6 mm{dA,dB}. 
dA 

Taking the logarithm (for small 5) leads to 

^ ^t \ } ris^ ^ ^ x™^'^'L^'4,dfl} dAmm{dA,dB} 

log^(r)-logcI>(p)^J ln2-c|>(p) =^ ln2-tr(p) " 

Proposition 4.4 implies that the conditional min-entropy is uniformly 
(Lipschitz) continuous on the set of normalized states and in any e-ball. 
Since D{p,t) < P{p,t), the bound also holds for 6 = P{p,t). Due to the 
duality between min- and max-entropy, Lipschitz continuity in P{p, r) also 
follows for the max-entropy. 

4.5.3 Conditioning on Classical Information 

Let us, more generally, consider a general state between two quantum 
systems, A and B, and a classical register, K. Such a state has the form 

pABK = ^Pk \k){k\ T^g, where r*^ G S<{:Kab) (4.15) 

k 

and {pk} is a probability distribution. 

Proposition 4.6. Let p G S<:{'Kabk) be of the form (4.15). Then, 

H^^{A\BK), = -\og{Y,Pk 2-^-n(^l^).'=) and 

k 

H^,M\BK)p = log ( Y^pk 2^n...(A|B)^,)_ 

k 

Proof. The fist statement directly from the sub-additivity of $, Property 
ii., and the fact that the marginal states on the conditioning system, r^ C3 



\k){k\, are orthogonal. Thus, 

^a\bk{p) = YPk ^a\bk{t'' \k){k\) = Ypk ^aib(t^). 
k k 

Note that we can remove the trivial register K in the last expression since 
<1> is invariant under the isometry r i— >• r iS" \k){k\. 
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The second statement follows by the duality relation of the min- and 
max-entropy (Lemma 4.1). We consider purifications of t^ on C and in- 
troduce a purification of p as 

\p)abckk' =T.Vp^\^\bc ® I^)k ® \k)K' . (4-16) 



ij^, = "Kji. Using this state and the fact that its marginal Pack' is 
of the form (4.15), we find 

H,,,,{A\BK), = -H^^{A\CK')p = log ( j;pfc2-^-n(^|C)..) 

k 

= log(^Pfc2^n-(^l^).^). D 



4.5.4 Concavity of the Max-Entropy 

The following bounds follow directly from the above lemma and the con- 
cavity of the log function. For a state of the form (4.15), we have 

H^^{A\BK)p <Y,Pk H^^{A\B),, and 

k 

H^,M\BK)p > Y,PkH^..{A\B),.. 

k 

Moreover, H^^^iA\B)p = log<l>AicKK'ip) > log^A\cK'ip) = H^^^{A\BK)p 
due to data processing of $ applied to the purification of p given in (4.16). 
Together with the fact that Pab = '^kPk'''AB^ this implies that the max- 
entropy is a concave function of the state. 

Furthermore, we have seen that ^a\b (which corresponds to the guess- 
ing probability if A is classical) is convex in the state. However, numerical 
evidence suggests that the min-entropy itself is neither concave nor convex 
in the state. 
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Chapter 5 

Smooth Entropies 



Smooth entropies are defined as optimizations of the min- and max-entropy 
over a set of close states. In this chapter, based on [TCRIO], we propose 
that this closeness should be measured in terms of the purified distance. 
This endows the smooth entropies with many useful properties, for example 
invariance under isometries and a duality relation. Among many other 
properties, we also show that the smooth entropies satisfy a data processing 
inequality and chain rules. 



5.1 Introduction and Related Work 

Generally, smooth entropies are defined as optimizations of the underlying 
entropy over a ball of states close to the state under consideration. In 
particular, the smooth min- and max-entropies are defined as 

HLni^\B)p = maxH^^^iA\B)p and H'^^^{A\C)p = min H^^^{A\C)p, 
p p 

where the optimization is over an e-hall of states p close to p. The con- 
ditional smooth entropies for quantum states are first introduced by Ren- 
ner [Rcn05]. In this work, various properties of the smooth entropies are 
shown, among them chain rules and various data processing inequalities. 

It is important to note that the metric used in [Rcn05] to define the 
e-ball was a generalization of the trace distance to sub-normalized states. 
Subsequently, we found that a metric based on the fidelity would be more 
suitable [Wul08]. Using such a metric, we can always find extensions, due 
to Uhlmann's theorem, that are as close as their marginals. This property 
does not hold for the trace distance metric used in [Ren05]. Note also 
that Renner used a max-entropy based on the Renyi entropy of order as 
a basis for the smooth max-entropy and was thus unable to harness the 
duality relation, which simplifies many proofs. 
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5.1.1 Main Contributions 

Using the purified distance as a metric, we are able to show many properties 
of the smooth entropies in a much more direct way as compared to [Ren05] . 
In particular, this smoothing allows us extend the duality relation of the 
min- and max-entropy (cf. Lemma 4.1) to e-smooth entropies. 

Result 3 (Duality Relation). For any e >0 and Pabc pure, we have 

Another result we want to highlight is a data processing inequality. It 
states that the uncertainty about the A system gets at most larger when 
we condition on less side information. 

Result 4 (Data Processing Inequality). For any e > 0, we have 

Hl,^{A\BC)p < Hl,^{A\B)p and Hl,^{A\BC)p < H^^,,{A\B),. 

This is a special case of a more general data processing inequality that 
holds for any TP-CPM applied to the system we condition on. 

This section also includes a variety of other results. For example, var- 
ious properties of smooth entropies of classical registers are discussed and 
should be very helpful in many applications. For example, we show that 
the smoothing on registers can be restricted to valid classical states with- 
out loss of generality. This implies that the smooth entropies are also 
well-defined in their classical limit. 

5.1.2 Outline 

In Section 5.2, we introduce the smooth min- and max-entropies and show 
several properties of the smoothing. Section 5.3 discusses some properties 
of the smooth entropies, including their invariance under isometrics and a 
duality relation. One of the most striking properties, a generalized data 
processing inequality, is shown in Section 5.4. Then, Section 5.5 explores 
the properties of the smooth entropies of classical registers. Finally, Sec- 
tion 5.6 summarizes some chain rules that were recently proven for the 
smooth entropies. 

5.2 Smooth Min- and Max-Entropies 

The smooth entropies of a state p are defined as optimization over the min- 
and max-entropies of states p ~£ p, i.e. over states that are close to p. Here, 
we define the smooth min- and max-entropies and explore some properties 
of the smoothing. In particular, we show that our definition of smoothing 
allows to extend the duality relation of the min- and max-entropy to the 
smooth entropies. 
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5.2.1 The Smooth Entropies 

We introduce sets of e-close states that will then later be used to define 
the smooth entropies. 



Definition 5.1 (e-Ball). Let p G 5<(J{) and < e < Y^tr(p). We define 
the £-ball of operators on !K around p as 

B'i^K; p) := {r G S< {'K) : P{t, p)<e]. 

Furthermore, we define the e-hall of pure states around p as Bp{'K; p) := 
{TGi3^(:K;p) :rank{T} = l}. 

For the remainder of this chapter, we will assume that e is sufficiently 
small so that e < Y^tr(p) is always satisfied. Furthermore, if the Hilbert 
space used is obvious from context, we will omit it and simply use the 
notation B^{p). We now list some properties of this e-ball that come in 
addition to the properties of the underlying purified distance metric. 

i. The set B^[^;p) is compact and convex. 

Proof. The set is closed and bounded, hence compact. For convexity, 
we require that, for any A G [0, 1] and o", r G B^{p), the state to := 
Ao" + (1 — A)t is also in B^{p). We define cD = w © (1 — trcj) and 
analogously p, a and f. Thus, Co = Aa + (1 — A)f by linearity. By 
assumption, we have F{a, p) > Vl — e^ and F{f, p) > \/l — e^. We 
use the concavity of the fidelity (cf. [NCOO], Section 9.2.2) to find 

P{u:,p) = ^l-F{(orpY 

= ^/l-F{\a + {l-\)f,pf 



< Jl-(AF(a,p) + (l-A)F(f,/}))'<e. 



Therefore, uj G B^{p), as required. D 

ii. Normalized states in B'^{'K; p) are not distinguishable from a normal- 
ized state p with probability more than 2(1 + e). 

Proof. By Proposition 3.3, r G B^{p) implies D{t,p) < P{t,p) < e. 
The statement then follows from (3.1). D 

iii. The ball grows monotonically in the smoothing parameter e, namely 
e < e' ^ B^i^i^p) C S"'(:K;p). Furthermore, S°(:K;p) = {p}. 

The smooth entropies are now defined as follows. 
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Definition 5.2 (Smooth Entropies). Let p £ 5<(!K^s) and e > 0. Then, 
we define the e-smooth min- and max-entropies of A conditioned on B of 
the state p as 

H^:,^{A\B)p := maxH^^^{A\B)p and 
p 

^max(^l^)p := mini7^,,(A|S)p, 
p 

where the optimization is over all states p E B^{pab) in both cases. 

Note that the extrema can be achieved due to compactness of the e-ball 
(cf. Property i.). We usuahy use Pab to denote the state that achieves the 
extremum, e.g., for the min-entropy, there exists a state Pab £ S^iPAB) 
such that Hf^-^^{A\B)p = Hj^^^{A\B)p. The state p is e-indistinguishable 
from p in the sense described in Property ii. Moreover, the smooth min- 
entropy is monotonically increasing in e due to Property iii. of the e- 
ball and we recover the non-smooth min-entropy in the hmit e = 0, i.e. 
H^^^^^{A\B)p = H^^^^{A\B)p. Similarly, the smooth max-entropy is mono- 
tonically decreasing in e and H^^^{A\B)p = H^g^^{A\B)p. 

If Pabc is normalized, the optimization problems defining the smooth 
min- and max-entropies can be formulated as SDPs. To see this, note that 
the restrictions on the smoothed state p are linear in the purification Pabc 
of p. In particular, consider the condition P{p, p) < e on p, or, equivalently, 
F'^iPiP) > 1 — e^. If Pabc is normalized, then the squared fidelity can be 
expressed as F'^{p,p) = tr{pABc Pabc)- 

We give the primal of the SDP for 2^^n^ini^\^)p as an example. This 
SDP is parametrized by an (arbitrary) purification p € S={'J{abc)- 

primal problem 



minimize: (ILb,o"b) 
subject to : 1a (8 cr^ > tro(pAsc) 
tr^pABc) < 1 
tic{pABcPABc) > 1 - e^ 
Pabc G Vi^ABc) 
(Tb G Vi'Ks) 

This program allows us to efficiently compute the smooth min-entropy. 

5.2.2 Remarks on Smoothing 

For both the smooth min- and max-entropy, we can restrict the optimiza- 
tion in Def. 5.2 to states in the support of Pa® Pb- 

Proposition 5.1. For any state p G S<{'Kab), there exist optimal states 
p,p£ B^{pab) msupplpAlf^supplps} such that H^-^XA\B)p = H^-^^{A\B)p 
and H^^,{A\B)p = H^,^{A\B)p. 
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Proof. Let Pabc be any purification of p on C, where Jic — "^ab- More- 
over, let n = liP^ (g) n^s be the projector onto the support of Pa ® Pb- 
Recall that we can express the min- and max-entropies in terms of the 
functional $, i.e. 

H^^{A\B)p = -\og^{p) and H^.^^{A\B)p = log ^^^dp). (5.1) 

For the min-entropy, first consider any state p' G B^CKab, Pab) that 
achieves the maximum in Def. 5.2, i.e. H^:^^{A\B)p = —log $(/?'). We 
have ^{p') > ^{Hp'H) due to the data processing property of $ (Prop- 
erty vi.). This implies that projecting onto 11 will not decrease the min- 
enropy. Moreover, since p = HpH, we find P{Ilp'Il, p) < P{p',p) < e due 
to the monotonicity of the purified distance under trace non-increasing 
maps. Hence, p = lip'Yi G B^{pab) and, thus, necessarily H^^j^{A\B)p = 

For the max-entropy, consider a state p' G B^{pabc) whose marginal 
achieves the minimum in Def. 5.2. (Note that every Pab £ S<{'Kab) has a 
purification in this e-ball around Pabc due to Theorem 3.5.) Thus, 

Hi,,^{A\B)p = \og^ aAp) > iog$.4|c(n^-V'n''^) > iogci>^|e(np'n). 

Here, the first inequality follows from the data processing property of <1> 
(property v.) and the second one from data processing on the purifying 
system (Property vii.). Since p = Up'H £ B'^{pabc)i we get H^^^[A\B)p = 
Hl,M\B)p. □ 

Note that these optimal states are not necessarily normalized. In fact, 
it is in general not possible to find a normalized state in the support of 
Pa ® Pb that achieves the optimum. However, if p is normalized, we can 
always find normalized optimal states if we embed the systems A and B 
into large enough Hilbert spaces that allow smoothing outside the support 
of Pa® Pb- For the min-entropy, this is intuitively true since adding weight 
in a space orthogonal to A, if sufficiently diluted, will neither effect the min- 
entropy nor the purified distance. For the max-entropy, the result follows 
from the duality of the entropies. 

Lemma 5.2. Let p G 5=(IK_4b). Then, there exists an embedding from "Ka 
to 'Ka' and a normalized state Pa'b S B^{pa'b) such that H^:^^{A'\B)^ = 
H^^^{A\B)p. Moreover, there exist embeddings 'Ka to "Ka' and "Kb to Kb' 
and a state Pa'b' £ B^{pa'b') such that H^^^{A'\B')p = H^^^{A\B)p. 

Proof To prove the property for the min-entropy, let {pabjO'b} be such 
that they maximize the smooth min-entropy A = H^^^{A\B)p, i.e. we have 
Pab < 2~^tA'Si(TB- Then we introduce an auxiliary Hilbert space Ka with 
dimension dA to be defined and embed Ka into Ka' — Ka(BKa- The state 
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Pa'b = Pab ® (1 — tr p) vr^ (5D cjs, where vr^ = Ia/^a, satisfies the required 
conditions. We have 

Pa'b = Pab ® (1 - tr p) vr^ (^ cr^ < 2^^(11^ ® 1^) (g) cjb 

if 2 (1 — ii p) < 2 e < d^, i.e. if d^ is chosen large enough. This imphes 
that the H^:^^{A'\B)^ of p is at least A. Moreover, F{p, p) = F{p, p) is not 
affected by adding weight into an orthogonal subspace. 

The equivalent statement for the max-entropy follows by duality. For 
any purification Pabc-, we have H^^^[A\B) p = —H^:^^{A'\C)p. The state p 
has a purification on B' (in fact, this defines the embedding) that is e-close 
Pa'b'c- Hence, the result follows. D 



5.3 Properties of the Smooth Entropies 

5.3.1 Invariance under Isometries 

The e-smooth min- and max-entropies are independent of the Hilbert 
spaces used to represent the density operator locally, as the following 
lemma shows. 

Proposition 5.3. Let e > and p^^ G S<{'Kab)- Then, for all emhed- 
dings "K^ — >• ^a' 0''nd "Kg — t- ^s', the embedded state Pa'b' satisfies 

-f^min(^l-S)p = H^ini'^'\B')p and H^^^{A\B)p = H^^^{A'\B')p . 

Proof. Let U : "Ka — )• ^a' and V : "Kb — ?• 3^^s' be above-mentioned em- 
beddings and let Pabc be a purification of p. We also introduce the state 
Pa'b'c = {U ® V)pabc{U'^ ® W), which purifies Pa'b'- 

We first consider the special case e = 0. The property for the min- 
entropy then follows directly from the invariance of $, i.e. ^a\b{Pab) = 
^a'\b'{Pa'b')- For the max-entropy, we use ^a\c{Pac) = ^a'\c{Pa'c), which 
establishes the equivalence of the max-entropies expressed in terms of the 
purifications Pabc and Pa'b'c-, respectively. 

To extend this to e > 0, we apply the following argument separately 
to the statement for the smooth min-entropy and the statement for the 
smooth max-entropy. The argument for the min-entropy goes as fol- 
lows. We first introduce states Pab £ B^{s^w{Pa ® Pb}'-, Pab) and Pa'b' £ 
i3'^(supp Ipa' (^ Pb']] Pa'b') that maximize H^;^^{A\B) and H^:^^{A!\B'), re- 
spectively (cf. Proposition 5.1). Since the purified distance is non-increasing 
under trace non-increasing CPMs (cf. Theorem 3.4), we immediately find 
that 

PA'B' ={U® V)Pab{U^ ® V^) G B'{^a'B';Pa'B') 
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is a candidate for the optimization of Hf^^^{A'\B'). Hence, 

-f^min(^'l^')p > H^miA'\B')p = H^i^{A\B)p = H^i^{A\B)p . 

The same argument appUes in the converse direction. There, we define Pab 
as the pre-image of Pa'b' under the isometry U ®V . (This is possible since 
pA'B' lies in the support of p^' ® Pb' and, thus, in the image of [/ (8> V .) 
This establishes 

-f^min(^l^)p > H^^^{A\B)p = H^^^{A!\B')p = H^:^^{A\B)p . 

Therefore, equality holds. D 

Note that the above extension of a non-smooth argument to a smooth 
argument is quite generic and only relies on the monotonicity of the purified 
distance under trace non-increasing maps. We will often use variations of 
the above technique to lift proofs for e = to e > 0. 

5.3.2 Duality of Smooth Entropies 

The duality relation in Lemma 4.1 extends to smooth entropies. 
Theorem 5.4. Let e > and let p G S<{'Kabc) be pure. Then, 

Proof. According to Proposition 5.3, the smooth entropies are invariant 
under embeddings. Let Pabc' be an embedding of Pabc iuto a space "Kab® 
"Kc' with dim{^c'} = maxjdc, ^as}- Then, 

Hl^{A\B)p= _ min H^,^{A\B)p = _ min H^,^{A\B)p 

pdB^ipAB) P<^B'p(pabc') 

= -. max H^^^{A\C')p >-_ max H^^^{A\C')p 

P&BpiPABC') p^^Hpac) 

= ~-f^mm(^|C' )p = —H^i^{A\C)p . 

We used that "Kc' is chosen large enough to accommodate all purifications 
of states p G B^{pab) (cf. Theorem 3.5). However, this also means that 
not all purifications of states pAc' ^ ^^{Pac) can be found in B^{pabc')i 
which leads to the inequality. 

On the other hand, we may consider the embedding Pab'c iuto a space 
"Kac ® ^B' with dim {1Kb'} = niax{dB,dAc}- Then, using the same argu- 
ments as above, 

H'^,,{A\B)p = H'^,^iA\B')p=_ min H^,,{A\B')p 

P&B^ipAB') 

-- «"}™ ,-^max(^l^')p =-, max H^^^{A\C)p 

P^BI,{Pab'c) P^BlipAB'c) 

= - max H^i^{A\C)p=-H'^i^{A\C)p. D 

peB^(PAc) 
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5.3.3 Relation between Smooth Entropies 

We have established in Proposition 4.3 that the min-entropy cannot exceed 
the max-entropy for normahzed states. Here, this result is extended to 
smooth entropies. (See also [VDTRll] for an alternative proof, which 
inspired the one provided here.) 

Proposition 5.5. Let p G 5=(IK^s) and e,e' >0 s.t. e + e' < 1. Then, 



P - -maxV-|-;p ^ ^-& 1 _ (£ + ^/)2 " 

Proof. First, note that we can always embed !K^ into a larger space ^a' 
such that there exists a normalized state Pa'b G ^^{Pa'b) that maximizes 
iJ^i„(A'|5) = H^^^^{A\B) (cf. Lemma 5.2). Similarly, by duality, there 
exists a normalized state Pa'b' £ ^^ {Pa'b') that minimizes H^^^{A'\B') = 
H^g^^{A\B) . The system B' contains B and the purification of the addi- 
tional weight introduced to normalize the dual smooth min-entropy. 

Hence, by definition of the smooth min-entropy, there exists a state 
a G S={J{b') such that Pa'b' < 2"^!^' ® cfb' with A = H^^-^{A'\B')p. 
Thus, 

H^,^{A\B), = Hi,^{A\B'), > log \\VP^VtA'(^^B'\\l 

> X + log\\^/ Pa' B'V Pa' B'W-^ = A + log(l - P'^ipA'B', Pa'b')) 

The first inequality follows from the definition of the smooth max-entropy, 
together with the fact that we took a particular a instead of optimizing 
over all a. The second inequality is a simple application of the operator 
inequality that defines H^-^^{A'\B). Finally, the triangle inequality for the 
purified distance establishes P{p, p) < £ + s' . D 

Remark 5.6. The term — log (l — (e + e')^) '^'^'^ ^^ reduced by the use of 
Eq. (3.8) instead of the triangle inequality for the purified distance. Hence, 

Hl^^{A\B)p < Hi,,{A\B), + 21og ^ ^. 

1- (eVl-e'^ + e'VT^e^ 



The range of allowed pairs {e,e'} is extended to those satisfying arcsin(e) + 
arcsin(e') < | . /n particular, this means that the term is finite if we choose 
e' = 1 — £, for any < e < 1. (See also Figure 3.1.) 

Proposition 5.5 implies that smoothing states that have similar min- 
and max-entropies has almost no effect. More precisely, let p G S={'Kab) 
be such that H^^^{A\B)p = H^^^{A\B)p - 5. Then, 

H'^^{A\B)p < H^,,{A\B)p - log(l - E^) = H^^{A\B)p + 5 - log(l - e^). 
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For (5 = 0, this inequality is tight. Note that the smoothed state p = 
p(l — e^) ~£ p reaches equality in this case. A corresponding inequality 
can be derived for the smooth max-entropy. 

5.4 Data Processing Inequalities 

We expect measures of uncertainty about the system A given side infor- 
mation B to be non-decreasing under local physical operations (e.g. mea- 
surements or unitary evolutions) applied to the B system. Such operations 
can be described most generally by TP-CPMs. Here, we show that the 
smooth entropies, H^^^^(A\B) and Hf^^^(A\B) , have this property. 

Another data processing inequality concerns rank-1 projective mea- 
surements of the system A. Such measurements can be described in 
terms of an orthonormal basis {|x)} of "Ka and a measurement TP-CPM 
Ai G T{^Ai ^x) from !Ka to "Kx — "Ka, which maps p to J2xi^\p\^) I^X^I- 
We expect that the uncertainty about the system A as well as the smooth 
entropies, H^-^j^{A\B) and Hf^^^{A\B), do not decrease when such a mea- 
surement is executed on the A system. 

In fact, we show a more general theorem that encompasses the two 
example data processing properties above. (Note, in particular, that M is 
a unital map.) 

Theorem 5.7 (Generalized Data Processing). Lete > and p G S<:{^abc) 
Moreover, let £. be a sub-unital and trace non-increasing CPM from 'Ka to 
Ji'A, let 3' be a trace non-increasing CPM from 'Kg to "Kb' and let 9 be 
a trace non-increasing CPM from 'Kc to 'Kc' ■ Then, the state Ta'b'c' = 
(Sog"o £)[/)^sc'] satisfies 

Htnui{MB)p < i?min(A'|5')r- 

Furthermore, if £■, 3' and 9 are also trace preserving, then 
H^a.A^\B)p < H^,^^{A'\B')r . 

Proof. We first prove the result for the smooth min-entropy. Let p £ 
B^iPAs) be the state that maximizes the smooth min-entropy. Then, data 
processing of <1> (Properties vi. and vii.) implies 

= - log ^a\b{p) 
<-log^A'iB'{{3'oE.m) 

<-iog^A'iB'i{9o:ioE)[p]). 

We now introduce the state f = (9 o 3"° £)[/o], which is e-close to r due to 
the monotonicity of the purified distance under trace non-increasing maps 
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(cf. Theorem 3.4). Thus, we conclude 

To prove the result for the max-entropy, we take advantage of the 
Stinespring dilation of two TP-CPM maps £ and 3". Namely, we introduce 
the isometrics U : A^ A' A" and V : B ^ B'B" and the state Ta'a'b'b" = 
{U (E> V)pab{U^ ^ y^) of which Ta'b' is a marginal. Let f G B'^{ta'a"b'b") 
be the state that minimizes the smooth max-entropy H^^^{A'\B')t-- Then, 

Hma.A^'\B')T = max log II v^Ta'sVIa' f^o-s' 11 i 

II / / Ii2 

> maxlog VfvB'Vti^A"(nA'A") f^cTs' 1 • (5.2) 

We introduced the projector Ha'a" = UW onto the image of U, which 
exhibits the following property due to the fact that £ is sub-unital: 



tr^"(n) = tr A" (UIaU^) = £[1^] < 1 



■A' ■ 



2 



The inequality in (5.2) is then a result of the fact that the fidelity is non- 
increasing when an argument A is replaced by a smaller argument B < A. 
(Property v. of the fidelity). Next, we use the monotonicity of the fidelity 
under partial trace (3.2) to bound (5.2) further. 

^max(^'|-S')T > maxlog II \^fA'A"B'B" V^A'A" O CJb's" II 1 

= max log II y/UA'A"fA'A"B'B"'nA'A"V^A'A" ® (^B'B"\\ i 

= H^,,iA'A"\B'B"),. 

Finally, we note that the projector f = n-fll E B^{pa'a"b'b") due to the 
monotonicity of the purified distance under trace non-increasing maps. 
Hence, we established H^^^{A'\B')r > H^^^{A'A"\B'B")r = H^^^{A\B)p, 
where the last equality follows due to the invariance of the max-entropy 
under embeddings (Proposition 5.3). D 

Note that a generalization of the data processing results to trace non- 
increasing maps is not possible for the max-entropy. For example, the 
max-entropy can decrease when a projection is applied to the B system. 

5.5 Classical Information 

Here we discuss smooth entropies of quantum states that encode partially 
classical information. To maintain full generality in the following argu- 
ments, we consider states on a four-partite system composed of X, y, A 
and B, where X and Y are classical registers. The smooth entropies are 
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evaluated for the classical register X and the quantum register A condi- 
tioned on classical side information Y and quantum side information B. 

In the following, we say that the state p £ S<{JixYAB) is classical- 
classical- quantum- quantum (CCQQ) if it can written in the form 

PxYAB = '^Pxy \x){x\x |y)(y|y O c^As , where tj^s G S<{'Kab), (5.3) 
x,y 

the kets {|x)}, {\y)} are orthonormal bases of "Kx and !Ky, respectively, 
and {pxy} is a probability distribution. This state has the property that it 
remains invariant under a projective measurement of X in the basis {\x)} 
and under a projective measurement of Y in the basis {|y)}. Formally, we 
define measurement TP-CPMs 

Mx:p^^{x\p\x)^\x){x\x and My : p ^ ^ (y|/)|y)y|y)(y|y (5.4) 

X y 

and note that Mx[p] = My[/9] = p. Furthermore, given an arbitrary state 
r G S<{JixYAB), it is easy to verify that the measured state (My o Mx)[t] 
is of CCQQ form (5.3). 

Partially classical states of the form (5.3) can be purified in a way 
that preserves their structure. For this, we introduce purifying Hilbert 
spaces ^x' — ^x, ^y — '^y and 'Kc — "^ab- Then, one may purify 
the states w^s individually on S<{^abc)^ which allows us to write down 
a purification \p) G ^xx'i-y'asc of the form 

\p)xx'YY'ABC = Y^ ^fV^y\^)x ® k)x' ® ly)y ® ly)y' ® \^''^)abc ■ (5-5) 
x,y 

It is easy to verify that purifications of this form commute with the 
following projectors, 

Uxx' ■=Y\x){x\x(E>\x){x\x' and Hyy := ^ |y)(y|y(g)|y)(y|y' . (5.6) 

X y 

In the converse, any state r G S^^Kxx'yy'ab) that commutes with both 
Ilxx' and Ilyy' has a CCQQ marginal Txyab- We say that the two systems 
X and X' as well as Y and Y' are coherent classical pairs. 

Some important properties of min- and max-entropies of coherent clas- 
sical states are discussed in Appendix A, Section A. 2 and will be used in 
the following. 

5.5.1 Smoothing of Classical States 

First, we show that — in order to smooth min- and max-entropies — it is 
sufficient to consider a ball of close states that are classical on the same 
subsystems as the original state. This is formalized in the following propo- 
sition. 



81 



5. SMOOTH ENTROPIES 



Proposition 5.8 (Classical Smoothing). Let p G S<{JixYAB) be classical 
on X and Y and let e > 0. Then, there exist states p, p £ B^{p) that are 
classical on X and Y such that 

W^^{XA\YB)p = H^^{XA\YB)p and 
Hl,^{XA\YB)p = H^^,^{XA\YB)f, . 

Proof. We first prove the statement for the min-entropy. By definition 
of the smooth min-entropy, there exists a state p G S^{p) such that 
H^-^{XA\YB)p = H^-^{XA\YB)p. We propose the CCQQ state p = 
(Mjf o My)[p] using the measurement in (5.4) as a candidate. Due to the 
data processing inequalities in Theorem 5.7, we have 



H,,,^{XA\YB)p < H^^{XA\YB\ 



p ■ 



Furthermore, p ^^ P due to Theorem 3.4 and, hence, p satisfies all condi- 
tions of the proposition. 

To prove the statement for the max-entropy, we consider the dual prob- 
lem for the min-entropy using the purification \p) in (5.5). Namely, we need 
to show that there exists a state p £ B^{p) that satisfies 

H'^,{XA\X'Y'C)p = H^^^{XA\X'Y'C)p , 

commutes with Ilxx', and is classical on Y' . Due to the first part of this 
lemma, there exists a state p G t3^{p) such that H^^j^{XA\X'Y'C)p = 
H^^^j^{XA\X'Y'C)p and p is classical on Y' . We propose the state p = 
Hxx'P^xx' as a candidate. Then, Lemma A. 3 and Theorem 3.4 establish 
that p satisfies all conditions, which conclude the proof. D 

5.5.2 Entropy of Classical Information 

Here, we bound the smooth entropies of a classical system X. For full 
generality, we consider the state Pxab, which is simply the state (5.3) with 
trivial classical subsystem Y. 

Proposition 5.9. Let p G SkCKxab) be classical on X and let < e < 1. 

Then, 

H^^,,iA\B)p<H'^^{XA\B)p<H^,,{A\B)p + logdx and (5.7) 

H'^,,{A\B)p < Hl,,{XA\B)p < Hl,M\B)p + \ogdx. (5.8) 

Proof. To show the first inequality of (5.7), we consider the dual problem 
for the max-entropy. Namely, we need to show that 



Ht,,^{XA\X'C)p < H^^,,{A\CXX'), 
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for the state Pxx'ac that is coherent classical between X and X' . To 
do this, note that there exists a p £ i3^{p) with H^^^(A\CXX')p = 
H^^^{A\C X X') p that has support on supp {/o^l'^supp {pxx'c} (cf- Propo- 
sition 5.1) and, thus, commutes with Ilxx'- The first inequality now follows 
after we apply Lemma A. 4 to p and realize that 

H^^^^iXA\X'C)p < H^^jXA\X'C)p < H^,^{A\XX'C)p. 

Since smoothing can be restricted to states that are classical on X 
(cf. Proposition 5.8), there exist states p G S^ip) and a G S={!Kg) such 
that 

PxAB = ^Px \x){x\x f^B < 2-^min(^^|s)p i^^ ^ ^^ _ 

X 

Since both sides of the inequality are block diagonal in X, this implies that 

=^ PaB = Y^Px f% < dx 2"^S.in(^A|B), 1^ ^ ^^ . 
X 

This implies the second inequality of (5.7). 

The first inequality of (5.8) is equivalent to the statement 

H'^^{A\XX'C)p > Hl^^{XA\X'C)p 

for a state p that is coherent classical between X and X' . The inequality 
then follows directly from Lemma A. 3 applied to the state f G S^{p) 
that maximizes H^^^(X A\X' C) p and Theorem 3.4, which establishes that 
p = lixx'T^xx' is a candidate for the maximization of H^-^^{A\X X' C) p. 
The second inequality of (5.8) is shown as follows. By the definition 
of the smooth max-entropy, there exists a state p G B'^ip) and a state 
a G S={'Kb) such that 

H'^,JA\B)p>2logF{p^B,tA0aB) 

> 2logF{pxAB, ;i-lxA ® (7b) > Hf^,^^{XA\B)p - logdx , 

Ux 

where a maximizes H^^^{XA\B)p and p minimizes H^^^{A\B) . D 

5.5.3 Conditioning on Classical Information 

We investigate the maximum amount of information a classical register 
X can contain about a quantum state A. (Various relations of this type 
have appeared in the literature, see, e.g. [RRIO, WTHRll]. The following 
lemma generalizes these results.) 
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Proposition 5.10. Let p G SkCKxab) be classical on Y and letO < e < 1. 
Then, 

Hl^^{A\B)p>Hl,^[A\BY),>Hl^XA\B)p-\ogd,- and (5.9) 

HL.AA\B), > H^,,iA\BY), > Hl,M\B)p - \ogdy. (5.10) 

Proof. The first inequalities of both (5.9) and (5.10) directly follow from 
data processing (cf. Theorem 5.7) applied to the TP-CPM try. 

Let p G B^{paby) be a state that achieves the maximal min-entropy 
for H'^^^{AY\B) p = H^^^{AY\B)p = A. In particular, there exists a state 
crG5<(:Ks) s.t. 

pABY < 2~^lAy (8) CTs = dy 2~^1^ (g) TTy (g) CTij . 

This implies that H^-^{A\BY)p > H^-^{AY\B)p - logdy The second 
inequality in (5.9) then directly follows from Proposition 5.9. 

To establish the second inequality in (5.7), we prove the dual property, 
i.e. for the purification Pabcyv that is coherent classical between Y and Y\ 
weshowthat/?^iJA|Cy')p < Hl^,^XMCYY')p+\ogdy. Let p G B^Pacy') 
and (7 G 5< (IK^y' ) be such that 



Pacy' < 2 ^Ia® Ocy', where A = Bl^,^{A\CY'\ 



p- 



Lemma A. 2 implies that any extension Pacyy' G ^^{p) satisfies 

pACYY' < dy Pacy' (^ ty < 2"^(iyly^ (g) UCY'- 

Hence, we find the following inequality 

H^^{AY\CY')p > Hl^{A\CY')p - \ogdy. 

An application of Lemma A. 3 to the min-entropy on the Ihs. establishes 

H^,XMCYY')f > Hl,,^{A\CY')p - logdy, where f = Iiyy,pliyy, 

is coherent classical between Y and Y' . Since p has the same property. 
Theorem 3.4 implies that f G B^{p), which concludes the proof. D 

5.5.4 Functions on Classical Registers 

Let us again consider the state Pax by of (5.3). On the one hand, applying 
a function f : Y ^ Z on the register y is a special case of data processing 
(cf. Theorem 5.7) and we immediately find 

H^^iA\BY)p < H^^^{A\BZ)p and Hl,^{A\BY)p < H^,^{A\BZ)p. 

On the other hand. Proposition 5.10 can be used to show that applying 
a function on the register X cannot increase the smooth entropies. (Note 
that for the min-entropy, using its interpretation as a guessing probability, 
this corresponds to the intuitive statement that it is always at least as hard 
to guess the input of a function than to guess its output.) 
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Proposition 5.11. Let Pxab = ^Px \x){x\ ® co^g with oj^^ G S<{'Kab) 
he classical on X. Furthermore, let Q < e < \ and let f : X ^ Z be a 
function. Then, the state Tzab = YIPx \f{x)){f{x)\z ® '^^ab satisfies 

Hl^^{ZA\B)r < Hl,,{XA\B)p and Hl,^{ZA\B)r < Hl,^{XA\B)p. 

Proof. A possible Stinespring dilation of / is given by the isometry U : 
|x)^ I—)- \x)^, (8) \f{x))^ followed by a partial trace over X' . Applying U on 
Pxab, we get 

TxZAB ■■= UPxAbU'^ = X]^^ \^){Ax' ® |/(x))(/(x)|y O UJ^B 

which is classical on X' and Z and an extension of Tzab- Hence, the 
invariance under isometries of the smooth entropies (cf. Proposition 5.3) 
in conjunction with Proposition 5.10 implies 

H'^^{XA\B)p = H^,^iXZA\B)r > Hl^^{ZA\B\. 

An equal argument for the smooth max-entropy concludes the proof. D 

5.6 Chain Rules 

The chain rule for the von Neumann (or Shannon) entropy is the relation 
H{AB\C) = H{A\BC) + H{B\C). To see that the von Neumann entropy 
satisfies this relation, simply substitute the definition of the conditional 
entropy, i.e. H{A\B) = H{AB) — H{B). For the smooth min- and max- 
entropies, this rule does not hold with equality. 

Instead, we give a collection of inequalities that replace the chain rule 
for the smooth min- and max-entropies. These were recently derived 
in [Vitll, VDTRll]. (See also [TSSRll, BCC+10] for preliminary resuhs.) 
These chain rules generalize some of the above results for classical regis- 
ters to general quantum systems; however, they introduce an additional, 
necessary smoothing parameter, e, and correction terms in log(2/e^) that 
do not appear in the results of the previous section. This is due to the fact 
that, previously, we gave bounds in terms of the dimension of the classi- 
cal register instead of its max-entropy. This simplifies the analysis and is 
sufficient in many applications involving classical information. 

The chain rules provided here should thus be considered complemen- 
tary the other results of this chapter. 

Theorem 5.12. Let e > 0, e' > 0, e" > and p £ S<{:Kabc)- Then, the 
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following chain rules hold 



1,^11 



2 



H'J:' "-' iAB\C)p > Hl^^{A\BC)p + Hl^XB\C)p - log -^, (5.11 



H^^:;+^^"{AB\C)p < Hi,M\BC), + hC,^{B\C), + log ^, (5.12) 

i/^f '+2-"(^|i3C7)p > H<,{AB\C)p - H^,^{B\C), - 21og |, (5.13) 

H^+/+^'"{A\BC), < Hi,^{AB\C)p - HC^{B\C)p + Slog |, (5.14) 

H^rl^''^^''\B\C)p > Hi^{AB\C)p - hC.M\BC)p - 31og |, (5.15) 

^mtf +''"(i?|C)p < Hi,^{AB\C)p - hC^{A\BC)p + 21og |. (5.16) 

For a proof, we refer to [Vitll]. Note that Eqs. (5.12), (5.14) and (5.16) fol- 
low by duality of the smooth entropies from Eqs. (5.11), (5.15) and (5.13), 
respectively. 
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Chapter 6 

The Quantum Asymptotic 
Equipartition Property 



The classical asymptotic equipartition property (AEP) is the statement 
that, in the limit of a large number of identical repetitions of a random 
experiment, the output sequence is virtually certain to come from the 
typical set, each member of which is almost equally likely. In this chapter, 
expanding on previous results in [TCR09], a fully quantum generalization 
of this property is shown, where both the output of the experiment and side 
information are quantum. We give an explicit bound on the convergence, 
which is independent of the dimensionality of the side information. 

6.1 Introduction and Related Work 

One of the pivotal results of classical information theory is the asymptotic 
equipartition property (AEP). This result justifies the use of the Shannon 
entropy to characterize many information theoretic tasks. It concerns a se- 
quence of independent and identically distributed (i.i.d.) random variables 
X"- = {Xi,X2,...,Xn) and states that [CT91] 

log Px" (x") -^ H{X)p in probability . 

Here, Px"{x^) = Px^^{xi,X2, . . . ,Xn) = Px{xi)Px{x2) ■ ■ -Pxixn) denotes 
the probability of the independent events x 1,3:2, ... ,x„. More precisely, 
the AEP states that, for any < e < 1, any // > and for large enough n, 
a randomly chosen sequence (xi, X2, . • . , Xn) is with probability more than 
1 — e in a typical set of sequences that satisfy 

H{x)-^l<--Px^{x'')<H{x) + ^l. (6.1) 

n 
This typical set of sequences of length n is denoted vl". 
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The AEP can be stated equivalently in terms of entropies. Recall the 
min- and max-entropies of a classical random variable X", which satisfy 

^min(^")p = ^oo(^")p = mm-logP^.(x") and 

^max(^")p = Hy,{X^), < HiaX - log P^. (x'^) . (6.2) 

The bounds in (6.1) can now be expressed in terms of the smooth min- 
and max-entropy. For this purpose, we consider the probability distribu- 
tion Q which restricts P to the typical set. It is formally defined as 

n f n. jPx4xn/c ifx"GA;^ 

Qx" [x ) = < , ^ , where c = > P^- [x ) . 

1° else x4lji 

The AEP now states that for large enough n, the fidelity between the 
distributions P and Q satisfies 



FiP,Qf = (^VPx<x-)Q^^{x-)) = Yl Px"(x")>l-e, 



2 



and, thus, the purified distance is upper bounded by ^/e. Finally, the AEP 
in (6.1), in conjunction with (6.2), implies that, for any 0<e<l, /z>0 
and large enough n, 

^i/^„(A")p > ^H^UX^Q > H{X), - ^^ and 
lHf,,{Xn, < Ih^,^{X^)q < H{X), + ^ . (6.3) 



n n 



Conversely, its entropic form (6.3) implies an AEP of the form (6.1). 
We only roughly sketch this argument here. First note that the fact that 
the smooth min-entropy converges to the von Neumann entropy implies the 
existence of a set A^ with the property max{P(x") : x" G A^} ^ 2~"'^^^^ 
and x" G A^ probability almost 1 . Similarly, the convergence of the max- 
entropy (together with the fact that the smooth max-entropy is related 
to smooth Renyi entropy of order 0) implies the existence of another set 
A"' with |^"-| Ri 2"-f^('^) and x" G A"' with probability almost 1. Since 
both these sets occur with probability almost 1, it is easy to see that their 
intersection, A^ n A^, constitutes a typical set. 

The entropic form of the AEP explains the crucial role of the von Neu- 
mann entropy to describe information theoretic tasks. While operational 
quantities in information theory (such as the amount of extractable ran- 
domness, the minimal length of compressed data and channel capacities) 
can naturally be expressed in terms of smooth entropies in the one-shot 
setting, the von Neumann entropy is recovered if we consider a large num- 
ber of independent repetitions of the task. 



6.1 Introduction and Related Work 
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Figure 6.1: Emergence of Typical Set. Plot of the surprisal, — logP(x"), 
over the cumulated probability of the events. We consider n independent 
Bernoulli trials with p = 0.2. The curves for n = {50,100,500,2500} 
converge to the typical set (indicated by the horizontal line). The min- 
entropy, H^^^{X"^), is the minimum of the corresponding curve, the max- 
entropy, H^^^{X'^), is upper bounded by its maximum. Moreover, the 
respective e-smooth entropies can be approximately obtained by cutting 
off a probability e from each side of the x-axis and taking the minima or 
maxima of the remaining curve. 
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Moreover, the entropic approach to asymptotic equipartition lends it- 
self to a generalization to the quantum setting. Note that the traditional 
approach, which considers the AEP as a statement about (conditional) 
probabilities, does not have a natural quantum generalization due to the 
fact that we do not know a suitable generalization of conditional probabil- 
ities to quantum side information. 

Here, we want to show that the smooth conditional min- and max- 
entropies converge to the von Neumann entropy in the i.i.d. limit. Recall 
that the conditional von Neumann entropy of a state p is defined as 

H{A\B)p = H{p^s)-H{ps), where H{p) = -tr{plogp). (6.4) 

This convergence can be shown in several ways, for example through the 
use of chain rules for smooth entropies [BRll], which reduce the problem to 
the classical AEP, or through the use of typical subspaces (cf. e.g. [NCOO]). 
While these techniques achieve the desired asymptotic limit, they fail to 
give good bounds on the convergence for finite n. More precisely, we 
are interested in the difference between the smooth entropies and the von 
Neumann entropy for finite n. This distance is in general a function of n, 
e and some properties of the quantum state under consideration. 

We call such a relation a fully quantum AEP because both the A and 
B systems are general quantum systems. 

For the smooth conditional min- and max-entropy, the convergence for 
finite n was first analyzed by Holenstein and Renner [HRll] for classical 
probability distributions. Renner also generalized these arguments to the 
quantum setting [Rcn05]. He shows that^, for any e > and for tensor 
product states pfg = Pab ® Pab tX" . . . Pab, it holds that 

-F^i,(A"|i?")p > H{A\B), - ^, (6.5) 



where 6 scales with the product of Hq{A)p and Y^log(l/e).^ 

On a technical level, our results are related to recent findings in quan- 
tum hypothesis testing [AMAV07, ANSV08, NOOO]. 

6.1.1 Main Contributions 

This chapter is based on [TCR09]; however, the results presented here are 
strictly more general as we extended certain proofs to relative entropies and 
a more general class of operators. This allows us to plug the inequalities 
into other arguments, where (e.g. smoothed) states are not necessarily 
normalized. 



^Note that the smoothing of the min-entropy was defined difi^erently in [Rcn05]. 
^Note that Hq{A)p = dA if the state p has full support on Ka; thus, this bound 
depends on the dimension of the A system. 
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The main result of this chapter establishes that 6 in (6.5) scales with 
H^^^{A\B) and H^^-^{A\R) instead of Hq{A), where i? is a reference sys- 
tem purifying /O^s. These conditional entropies measure the correlations 
between the subsystems A, B and R and are often much smaller than 
Hq{A). In particular, conditional entropies do not depend on the Hilbert 
space dimension of any subsystem, which is useful in the context of quan- 
tum cryptography, where these dimensions are generally unknown.^ 

Result 5. For any e > and n large enough, we have 

-F^i„(A"|5") > H{A\B) - ^%^ and 
n vn 



^Hl,^{A-\B-)<H{A\B) + ^-^., 



where 6{e,v) = 41ogUA/log(2/e2) andv = \/2^max(^|B) + V2^max(^|R) +1. 

Together with converse bounds, this implies the fully quantum asymp- 
totic equipartition property. 

Result 6. The smooth entropies converge to the von Neumann entropy in 
the asymptotic limit of many copies. For any < e < 1, we have 

lim -H^^^{A^\B-) = hm -H^^,^{A^\B-) = H{A\B) . 

n— >oo n n— >oo n 

Note, in particular, that our statement holds for any < e < 1. This 
improves on to the preliminary results in [TCR09], where the converse 
bound was only shown in the limit e — )• 0. Additionally, we improve many 
bounds for the case where the smoothing parameter is close to 1. This 
extension is important to show strong converse statements, as we will see 
in Section 8.1. 

6.1.2 Outline 

The remainder of the chapter is organized as follows. In Section 6.2, we 
explore the special case of classical registers without side information and 
introduce the proof techniques, based on relations between the smooth min- 
entropy and Renyi entropies, that will also lead to the fully quantum AEP. 



^Note also that the appearance of the reference system in the convergence rate is 
not entirely unexpected. This can be seen from the fact that (6.5) directly imphes a 
bound on the max-entropy, i.e. 

^HLUA"\E:') < H{A\R) + ^, 
n ^n 

by duality of the entropies. Hence, it is no surprise that this duality also appears in the 
convergence rate. 
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In Section 6.3, we introduce quantum generalizations of the classical Renyi 
a-entropies. (See also Appendix B, where some of their properties are 
discussed.) Then, in Section 6.4, we show, analogous to the classical proof, 
that the smooth min-entropy can be bounded in terms of these entropies. 
Section 6.5 then bounds the difference between Renyi a-entorpies and the 
von Neumann entropies for a. close to 1 . Finally, these results are combined 
in Section 6.6 to prove the fully quantum AEP. 

6.2 Sketch of a Classical Proof 

The relation between the traditional formulation of the AEP in terms of 
probabilities [CT91] and its entropic formulation was already explored in 
the preceding section. To give an intuitive idea of the techniques used in 
the following, we first prove a special case of the AEP for classical registers 
and take an additional limit e — )• 0. More precisely, we show that 

hm hm -i/^i„(X") = }i[X) . 

We employ the Renyi a-entropies [Rcn61] 

i/,(X):=— ^logVP(xr, aE(0,l)U(l,oo), (6.6) 

1 — Q ^-^ 

for which Hoc {a — )• oo), Hq {a — )• 0) and the Shannon entropy (a — )• 1) 
are defined as limits. Furthermore, the entropies Ha are monotonically 
decreasing in a and, as shown in [RW04], the Renyi entropies with a > 1 
are close to the smooth min-entropy in the sense that 

i/^i„(A) > HaiX) - -^ log ^, a > 1 

a — 1 E 

while those with a < 1 are close to the smooth max-entropy. Note that 
the error term ^— j-logl/e in (6.7) diverges when we try to recover the 
Shannon entropy. However, in the case of an i.i.d. sequence we find 

-i/^i„(X") > Ha{X) - ^^ log - (6.7) 

n n[a — 1) e 

where we have used H^iX'^) = nH^iX). We proceed by bounding the 
entropy -H^^^{X^) in the limit n — t- oo,e — t- from above and below. To 
get the lower bound, we choose a = 1 + l/\/n and take the limit n — t- oo 
in (6.7). An upper bound in this limit follows directly from the fact that 
H^^^j^{X) < H{X) (cf. Proposition 4.3 or [Rcn61]) and the continuity of 
the Shannon entropy.^ 

A similar argument shows that the smooth max-entropy converges to 
the von Neumann entropy in the asymptotic limit. 

^See, e.g. Fannes [Fan73, AF04], where the relevant continuity property is shown for 
(conditional) von Neumann entropies. 
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6.3 Quantum Relative Entropies 

We prove the asymptotic equipartition property for relative entropies, 
which are introduced here. Conditional entropies can be seen as special 
cases of relative entropies. 

6.3.1 Quasi-Entropies 

A very general class of classical relative entropies are the /-divergences, 
originally introduced by Csiszar [Csi72]. They have been generalized to 
the quantum setting by Petz [Pet84], who calls them quasi-entropies. (See 
also [OP93, PetlO, HMPBll] for an overview of recent results on these 
entropies.) The crucial observation is that some of the most interesting 
mathematical properties of the von Neumann and Shannon entropies are 
a direct consequence of the (operator) concavity of the function h : t >-^ 
—t\ogt that defines the functional H{p) = tr(/i(/9)). Hence, the following 
generalization of this functional was investigated. 

Definition 6.1 (Quasi-Entropy). Let A,B e P(:K) and let f : R^ ^ R 
be continuous. Then, the f- quasi- entropy of A relative to B is 

Sf{A II B) := lim (T\,jB + itf[{B + ^1)-^ ® vl^) yj B + ^t 

where \T) = ^^ |^)'^|^) o.nd {|i)} is an orthonormal basis ofK, with respect 
to which the transpose is defined. 

Unconditional quasi-entropies are recovered when we substitute B = 1. 
Let us consider a few examples of such functions. For this purpose, let 
A,Bg VCK) be two positive operators whose supports satisfy supp {A} C 
suppjB}, such that the limit in Def. 6.1 is always finite and B^^ can be 
interpreted as a generalized inverse. First, using h : t ^ —tlogt extended 
to Mq" using /i(0) = limt_>o h{t) = 0, the relative von Neumann entropy is 
given as 

SiA\\B) := ShiA\\B) = tr {AilogB - log A)) . (6.8) 

Note that we omit a minus sign here that is present in the conventional 
definition of the relative entropy. To derive this well-known expression from 
Def. 6.1, we used that {T\X l\r) = tr(X) and {X (E> 1)\T) = {1(E> X'^)\T) 
for any operator X G C{'K). 

6.3.2 Relative Renyi entropies 

As another example, we recover generalizations of the Renyi entropies of 
order a € (0, 1) U (1, oo). This is achieved using ga : t ^^ t" and 

So.{A\\B) ~ ^\ogSgM\\B) = -^logtr(A-i?i-"), (6.9) 

1 — a \ — a 
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where, for a > 1, we use the generahzed inverse of a. We may continuously 
extend the range of a to the limits a — )• and a — t- oo. Moreover, the von 
Neumann relative entropy emerges in the limit a — )• 1 if tr(A) = 1. Hence, 
we can continuously extend the range of valid parameters to a S Mq" by 
setting 5i = S* in this case. Some properties of these entropies are discussed 
in Appendix B. 

6.3.3 Relative Min- and Max-Entropy 

In addition to this, we will also need a relative entropy version of the 
min- and max-entropies. The relative min-entropy was introduced by 
Datta [Dat09]^ and is directly related to the conditional min-entropy de- 
fined in [Rcn05]. The relative min- and max-entropies of A G V{'K) relative 
to -B e V{'K) are given by 

Srnin{A \\B) := sup {A : ^ < 2"^S} and (6.10) 

c / /I II D\ 1 — , IL n\^ /dI|2 



S^UA\\B) ■.= \og\\^VB 



\r 



We also need a smoothed version of the relative min-entropy, which we 
define for any p G S<{'K) and a G V{'K). 

S^rain{p\W) '■= ^^X 5min(/5 Ho") . (6.11) 

Clearly, the (smooth) min- and the max-entropy (as defined in Chap- 
ter 4) of A conditioned on 5, H^^^{A\B)p and H^^y^{A\B)p, can be recov- 
ered by the substitution A = p^s and i? = l^i (8) <7s , where Us is maximized 
over 5=(^fl). 

6.4 Lower Bounds on the Smooth Min-Entropy 

The following lemma gives a first lower bound on the smooth relative 
entropy (6.11). (A similar, less general lemma can be found in [DR09].) 

Lemma 6.1. Let p £ S<:{'K),a G V^K) and A > «S'min(p ||o"). Then, 
SLiniP Ik) > -^ > where e = V2tr(A) - tr(A)2 and A = {p - 2-^a}+. 

Proof. We first choose p, bound S^^^{p \\a), and then show that p G B^{p). 
We use the abbreviated notation A := 2~ a and set 



P 



:=GpG\ where G := A5(A + A)"? , 



where we use the generalized inverse. From the definition of A, we have 
p < A -|- A; hence, p < A and 5'min(p \W) > A. 



^There, it appears under the name Du 
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Let IV') be a purification of p, then (G i^ t)\tp) is a purification of p 
and, using Uhlmann's theorem (3.3), we find a bound on the (generahzed) 
fidelity: 



Fip,p) > MCm + V(l-tr(p))(l-tr(p)) 

> K{ tr(Gp)} + 1 - tr(p) = 1 - tr ((1 - G)p) , 

where we introduced G := 2(G' + G'). This can be simplified further after 
we note that G is a contraction.^ To see this, we multiply A < A + A with 
(A + A)~2 from left and right to get 

GtG= (A + A)-5A(A + A)-5 < 1. (6.12) 

Furthermore, G < 1, since ||G|| < 1 by the triangle inequality and ||G|| = 
||Gt|| < 1. Moreover, 

tr ((1 - G)p) < tr(A + A) - tr (G(A + A)) 

= tr(A + A) - tr((A + Ay/^A'/^) < tr(A) , 

where we used p < A + A and \^A + A > vA- The latter inequality fol- 
lows from the operator monotonicity of the square root function (cf. Sec- 
tion 2.3.2). Finally, using the above bounds, the purified distance between 
p and p is bounded by 



Pip,p) = ^1-F^{p,p)) < ^1 - (1 - tr(A))' = V2tr(A) - tr(A)2 . 
Hence, we verified that p £ B^{p), which concludes the proof. D 

In particular, this means that for a fixed e £ [0,1) and suppjp} C 
suppjo"}, we can always find a finite A s.t. Lemma 6.1 holds. To see this, 
note that e(A) = Y^2tr(A) — tr(A)2 is continuous in A with e{Sinm{p W^)) = 
and limA-xx) e(A) = 1. 

Renyi Entropies and Smooth Min-Entropy 

Our main tool for proving the fully quantum AEP is a family of inequalities 
that relate the smooth relative min-entropy to relative Renyi entropies for 
a G (1,2]. This family contains the von Neumann relative entropy in the 
limit a — )■ 1. This can be seen as a quantum generalization of the classical 
inequality in [RW04]. 

Proposition 6.2. Let p G S<{:K),a G V{:K), < e < 1 and a e [1,2]. 
Then, 

SminiP Ik) > Sa{p lk) ", where g{e) = log , (6.13) 



a- 1 1 - VI 



e 



2 



^A contraction G is an operator with operator norm ||Gj| < 1. 
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Proof. We consider two cases: (1) The a entropy diverges (to — oo) and the 
inequahty holds trivially. (2) We have suppjp} C supp {cr}. In this case, 
we can find an isometry !K' — )• ^ that maps a a' to a and p' to p s.t. a' has 
full support. The niin- and a-entropies are invariant under this isometry 
due to Proposition 5.3 and Lemma B.l, thus, we henceforth assume that 
a is invertible in this proof. 

We use Lemma 6.1 to get a first bound on -0^;^; in particular, we 
choose A s.t. Lemma 6.1 holds for e. Next, we introduce the operator X := 
p — 2~^a with eigenbasis {lei)}^^^. The set S^ C S contains the indices i 
corresponding to positive eigenvalues of X. Hence, P~^ := Ylii^s+ l^«X^«l i^ 
the projector on the positive eigenspace of X and P^XP^ = A as defined 
in Lemma 6.1. Furthermore, let ri := (ej|p|ej) > and Sj := {ei\a\ei) > 0. 
It follows that 

Vi G 5"+ : ri- 2~^Si > and, thus, —2^ > 1. 

Si 



For any a G (1, 2], we bound tr(A) = 1 — Vl — ^^ as follows: 
1 - Vl-e^ = tr(A) = J^ r, - I'^Si < ^ Vi 



ies+ i€S+ 

a-l 






ies+ ^ - ^ i(zs 

Hence, taking the logarithm and dividing by a — 1 > 0, we get 

Next, we use the monotonicity of the Renyi entropies (cf. Lemma B.4). 
We use the measurement TP-CPM M : X i-> SieS \^i){^i\^\^i){^i\ to ob- 
tain 

S^{p\\a) < S^{U{p)\\M{a)) = ^ log J^rf s^". 

We conclude the proof by substituting this into (6.14) and applying the 
upper bound on A in Lemma 6.1. D 

We also note here that the term 1 — \Jl — e^ in g{£) can be bounded by 
simpler expressions (cf. Fig. 6.2). We find 1 — \/l — e^ > ^ using a second 
order Taylor expansion of the expression around e = and the fact that 
the third derivative is non-negative. This is a very good approximation for 
small £. Hence, Eq. (6.13) can be simplified to 

1 2 

'S'min(P Ik) > Sa{p \\(t) - -^—^ log ^ . (6.15) 

This form of the inequality been reported previously [TCR09] . 
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Figure 6.2: Bound on g{e) . Plot of the function g{e) = — log (l — Vl — £^) 
and its upper bound, log \. The horizontal axis denotes e. 



Conditional Entropies 

This lemma is of particular interest when applied to the smooth conditional 
min-entropy. In this case, let p S S<_{^ab) and a be of the form 1^ ® Pb- 
Then, for any a S (1, 2], we have 



> SaipAB^A® Pb) 



'^'^ -HM\B),-^. (6.16) 



a — 1 



a — 1 



The duality relation for the smooth min- and max-entropies (cf. Theo- 
rem 5.4) and the Renyi a-Entropies (cf. Lemma B.5) imply a corresponding 
dual relation for the max-entropy. For any a G [0, 1), we have 



Hf,,,{A\B), < H^{A\B) 



9{e) _ 
1 — a 



(6.17) 



6.5 From Renyi to von Neumann Entropies 

We will use Proposition 6.2 to get a lower bound on the smooth min- 
entropy in terms of a-entropies, hence, it remains to find a lower bound 
on the a-entropies in terms of the von Neumann entropy. In turn, the 
bound on the convergence will depend on the smoothing parameter e and 
a contribution T(p ||(t) that describes how fast the a-entropies converge to 
the von Neumann entropy. 

Definition 6.2. Let p,a €z VCK), then we define the a-entropy conver- 
gence parameter, 

T(p ||cr) := 2"5'^3/2(''ll'^) + 2^'^V2(/'II'^) + 1 . 
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We can now state a bound on the a-entropies for a close to 1. 

Lemma 6.3. Let p £ S={'K),a £ VCX) and 1 < a < 1 + ^^, where 
V = T(p \\a). Then, 

S^{p\\cj) > S{p\\a) - 4(a - l)(logt;)'. 

Proof. We assume that a is invertible in this proof. The general result 
then follows by the arguments outlined at the beginning of the proof of 
Proposition 6.2. 

Let {\i)} be an orthonormal basis of "K. The state I7) := X^i K) "^ N) 
is the (unnormalized) fully entangled state on 5{ C?) 5C. We introduce a 
purification \ip) := ^ I7) of p. To simplify notation, we use ^5 := a — 1 as 
weh as X := p(g) {a-^Y . 

Let us first approximate Sa for small /3 > 0. 

So.{p Ik) = -i log ((^ix/^iv.) > ^ (1 - {^\^^W)) , 

where we used Inx < a; — 1 for all x > 0. We now expand the exponential 
t^ for each eigenvalue t > of X as follows: v^ = 1 + /3 In t + r^(t), where 
rfi{t) := t/^ - /?lnt - 1. This leads to 

S.{p Ik) > ^ (1 - tr(p) - /3(v'|lnX|v.) - {^\r f,{X)\^)) 

■>S{p\\a)-^^{^\Tf,{X)\^). (6.18) 

To simplify this further, we note that 

r^it) < 2(cosh(/31nt) - 1) =: Sf}{t) . 

It is easy to verify that s^ is monotonically increasing for t > 1 and concave 
in t for P < 1/2 and t e [3, 00). Furthermore, we have S/3(t) = S/3(j) and 
■5/9 (*^) = S2/3{t). We use this to bound^ 

s^{t)<s^(t + -^+2^ =S2is(^Vi+-^^ <S2^(yi+-^ + iy (6.19) 

Next, we apply (6.19) to the matrix element in (6.18) and use the fact 
that the operator VX + 1/vX + 1 has its eigenvalues in [3, 00) and 2(3 < 
2wl; — 5 together with Lemma 2.7 to get 

{ip\sp{XM < {^\s2p(^Vx + -^ + a) \ip) < S2i3{v) , (6.20) 



^Adaptions of this step lead to different bounds. Here, we are interested in a bound 
that can be expressed in terms of S1/2 and S3/2 ■ 
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where we substituted v = {^\vX + 1/vX + t\ip). 

Taylor's theorem and an expansion around (3 = gives an upper bound 
on Sf}{t): s/3(t) < ^2(lnt)2cosh(/31nf). Hence, 

— *— S2/3(i;) < 4/3(logt;)2ln2cosh(2/31ni;) < Al3(logvf , (6.21) 

pin 2 

where we simplified the expression (for convenience of exposition) using 
ln2cosh(ln3/2) < 1. The lemma now follows after we substitute (6.21) 
and (6.20) into (6.18). D 

This Lemma can be extended to include sub- normalized states, p G 
5<(;K). Let p = p/ tr(/j) and v = T(p \\a), then 

SaiP Ik) = Saip Ik) H log ; 



a — 1 tr(/9) 

> Sip\\a) + ^log-i^ -4(a- l)(logi))2 
a — 1 tr[p) 

^ .S(p|k) + ^^log-^-4(a-l)(log{))2. 



tr(p) a — 1 tr(/)) 

Now we combine Proposition 6.2 and 6.3 to get the desired bound for 
i.i.d. operators. (Note that we restrict p to normalized states because, if 
tr(p) < 1 then the trace of p*^" drops exponentially to zero. An extension 
of this to sub-normalized states, while possible, thus seems uninteresting.) 

Theorem 6.4. Let p G S={Ji),a G V{:K), < e < 1 and v = T{p\\a). 
Then, for any n > lg(e), the i.d.d. operators p®^ and o"®" satisfy 

-'5^in(P®"lk®")>^(plk)-%^ where 6{e,v) = Alogv^/^) 
n \ n 



an 



d g{e) = - log (l - Vl - £2) . 



Proof. We use Proposition 6.2, the additivity property of the Renyi en- 
tropy (B.3) and Lemma 6.3 to get a bound on the smooth min-entropy. 
Let a := 1 + ^ — y= for a parameter p (to be optimized over), then 

-^min(P®"lk^") > -Saip^'^a®! - -r^.aie) 
n n n[a — 1) 

= Sa{p\\a) ^g{e) 

Jn 



> Sip Ik) - -^(pg{e) + -(logt;)2) . (6.22) 



fn\ p 

Clearly, we want to choose p such that it minimizes the expression pg{e) + 

log 3 
41ogD 



p ^(logu)^. However, the requirement a < l+jw^ in Lemma 6.3 restricts 
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the choice of // for any fixed n, hence, the error term is in general also a 
function of n. Nonetheless, for large enough n the optimum, ^u*, can be 
reached^ and we get 



'(log^ for n>'J^ . 

5(e) 5 n^ 5^ 



\x^ = \l ^^^^ for n>- — -^ — = -g[e) . (6.23) 



Substitution of this expression into (6.22) concludes the proof. D 

6.6 The Asymptotic Equipartition Property 

6.6.1 Direct Part 

In this section, we are mostly interested in the application of Theorem 6.4 
to conditional min- and max-entropies. Here, for any state p G S={'Kab), 
we chose cjas = H.a'^ Pb and apply Theorem 6.4. This implies that 

n n ^ 

> b[pAB,(yAB) j^- = H{A\B)p 



This (and the dual of this relation) leads to the following corollary. 

Corollary 6.5. Let p G S={'Kab) O'^d < e < 1. Then, the smooth 
entropies of the i.i.d. product state Pa"b" = Pab satisfy 

-H'^,,{A^\B^)p>H{A\B),-^-^ and (6.24) 



ii/^,,(A"|i?"), < H{A\B), + ^^ , (6.25) 



where 6{e, u) is defined in Theorem 6.4 and v = T{pab H^a ® Ps). 

We may now use (B.2) and its dual relation, H^,^{A\B)p > H^^^{A\B)p 
(cf. Theorem 5.4 and Lemma B.5), to bound 



This is Result 5 of Section 6.1. 

The following is a trivial corollary from Theorem 6.4 and the above 
arguments, in particular (6.24) and its dual relation (6.25). 

Corollary 6.6 (AEP, direct). Let p e 5=(!K^fl) andO <e <l. Then, 
hm -H^^^iA-\B-)p>H{A\B)p> lim -//^,,(A«|i?"), . 

n— >-oo n n— >oo n 



^To verify this, evaluate an upper bound to a = 1 + (2/i,y'ri) ^ using the expression 
for n in (6.23) and note that ^5/2 < log 3. 
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6.6.2 Converse Part 

To prove asymptotic convergence, we will also need converse bounds. For 
e = 0, the converse bounds are given by Proposition 4.3, i.e. H^^^{A\B) < 
H{A\B) < H^g^^{A\B). For e > 0, similar bounds can be derived from 
the continuity of the conditional entropy in the state [AF04]. However, 
such bounds do not allow a statement of the form of Corollary 6.6 as the 
deviation from the von Neumann entropy scales as n/(e), where /(e) — t- 
only for e — )■ 0. (See, for example, [TCR09] for such a weak converse 
bound.) This is not sufficient for some applications of the asymptotic 
equipartition property. 

Here, we prove a tighter bound, which relies on the bound between 
smooth max-entropy and smooth min-entropy established in Proposition 5.5. 
Applying this Proposition in conjunction with Eqs. (6.24) and (6.25) es- 
tablishes the converse AEP bounds. Let < e < 1. Then, using any 
smoothing parameter < e' < 1 — e, we bound 



1_ . 1„.' 1. 1 

n 

ilog I ^'-^^^ 

n 1 — {e + e'Y y/n 



-Hl,^{A\B), < -Hi^^{A\B)p + - log ^_^^^^,y 

< H{A\B), + i log ^ _ ' ^., + "-^^. (6.26) 



The corresponding statement for the smooth max-entropy follows by the 
dual argument. We thus find 

Corollary 6.7 (AEP, converse). Let p e S={^ab) and < e < 1. Then, 

hm -H^^.JA^\Bnp<H{A\B),< lim i/f^,,(A«|i?"), . 

These converse bounds are particularly important to bound the smooth 
entropies for large smoothing parameters. In this form, the AEP implies 
strong converse statements for many information theoretic tasks that can 
be characterized by smooth entropies in the one-shot setting (see, for ex- 
ample. Chapter 8). 
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Chapter 7 

Uncertainty Relations for 
Smooth Entropies 



Entropic uncertainty relations (UCRs) provide lower bounds on the un- 
certainty of the outcomes of two incompatible measurements given side 
information in terms of conditional entropies. In this chapter, which is 
based on [TRll] and [HTll], we prove several uncertainty relations using 
smooth min- and max- entropies as well as the von Neumann entropy as 
measures of uncertainty. 

7.1 Introduction and Related Work 

Uncertainty relations have inspired physicists since the early days of quan- 
tum mechanics, when Heisenberg [Hei27] formulated his famous uncer- 
tainty principle. In its Robertson [Rob29] form, it states that the product 
of the standard deviations of the outcomes of two incompatible measure- 
ments on a pure state {ip) is lower bounded in terms of the commutator of 
the observables {X and Y) of these measurements. 



1 

0"x • <7y > - 



'M[x,yU) 



(7.1) 



2 _ /J,\72l 



Here, the variance of the measurement outcomes is given as o"| = {ip\Z 
{ip\Z\'ip)'^ where Z is either X or Y. The commutator, [X, Y] = XY — YX, 
quantifies the incompatibility of the two observables. 

Note that the uncertainty relation becomes trivial if the measured state 
is an eigenstate of either X or Y. More generally, the lower bound in (7.1) 
depends on the state {ip) before measurement, which is often undesirable. 
For example, the state might be unknown or, in a cryptographic setting, 
prepared by an adversary. Furthermore, the standard deviation is not our 
preferred measure of uncertainty as it conflates two concepts: the value 
associated with different measurement outcomes and the uncertainty in 
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the probability distribution of the outcomes. The latter is the uncertainty 
we are interested in and is often quantified by entropies. 

Uncertainty relations in terms of entropies were first proposed by Hirsch- 
man [Hir57], and Deutsch [Deu83] for the case of a finite output alphabet.^ 
In the form proposed by Maassen and Uffink [MU88], it states that 

1 I i2 

H{X)+H{Y) > log- , where c = max(x|y) . (7.2) 

C x,y ' ' 

for any state p before measurement. Two registers, X and Y, store the 
respective outcome of two different projective measurements, X and 3^. 
The overlap, c, is determined by these measurements and independent 
of the state before measurement. More specifically, the maximization in 
the definition of the overlap is taken over all eigenvectors \x) oi X and 
\y) of Y, where X and Y are observables corresponding to the projective 
measurements X and 3^. 

More generally, Krishna and Parthasarathy [KP02] considered POVMs, 
given by sets X = {Mx} and y = {Ny} of positive semi-definite opera- 
tors. For such measurements, the states of the registers containing the 
measurement outcomes are given by 



Px = ^i^ {^/M\x P^/M~^\x){x\ and py = ^tv i^j p^^\y){y\ 



The uncertainty relation (7.2) now holds for these states and the overlap 

2 



c := max 

x,y 



(7.3) 



Note that the overlap reduces to (7.2) if the measurements are projective. 
The uncertainty relation can be extended to one with classical side in- 
formation (see also [Hal95, CBKG02]). Let the state between the system to 
be measured. A, and an observer, O, be of CQ form Pao = ^oPo |o)(o|<8'/0° . 
This can be seen as the observer preparing the state /o^ with probability po- 
What is the entropy of the observer O about the measurement outcomes 
X and Yl It is easy to see that 

H{X\0) + H{Y\0) = Y,Po [H{X)po + H{Y)po^ ^ ^^S ^ • 

o 

Thus, the uncertainty relation (7.2) still holds. Furthermore, we may model 
the random basis choice by a uniformly distributed bit Q ^ {X , 3^} that 
is independent of the state Pao- Hence, we consider a joint state Paob = 
Pao ® '^e and a measurement on the A system that depends on the classical 



^Sce also [WWIO], which offers a comprehensive review of uncertainty relations. 
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register Q} The measurement outcome is denoted Z, which replaces X 
and Y. Thus, 

H{Z\Oe) = Ih{X\0) + Ih{Y\0) > ^ log ^ . (7.4) 

This uncertainty relation with classical side information is also visualized 
in Figure 7.1. 

A statement of the uncertainty relation in this form naturally leads to 
the question of what happens if the observer is allowed to store quantum 
information about the system, i.e. if the state Pao is entangled. For the 
case of (7.4), only the trivial bound H{Z\OQ)p > holds. To see this, 
consider a fully entangled state between A and O. In this case, the observer 
can predict the value of Z perfectly using an appropriate measurement 
(depending on 0) on his system. This is explained in Figure 7.2. 

However, due to the monogamy of entanglement^, it is unclear what 
happens if we introduce a second observer. Note, for example, that the 
entropy H{A\0) before measurement can be negative for entangled states 
while the sum H(A\Oi) + H{A\02) is always non-negative. 

In fact, Renes and Boileau [RB09] (see also [CW05]) conjectured the fol- 
lowing uncertainty relation, which was later shown by Berta et al. [BCC"'"10]. 
For a tripartite quantum state shared between the system A which is mea- 
sured in the basis determined by G and two observers, Oi and O2, it holds 
that 

H{Z\Oie)+H{Z\02Q)>log-, (7.5) 

c 

with c defined as in (7.3) but only for rank-1 projective measurements. This 
proof was later extended to POVMs and simplified by Coles et al. [CYGGll, 
CYZll]. Concurrently, our results also imply a simplified proof of the un- 
certainty relation for von Neumann entropies and POVMs. 

7.1.1 Main Contributions 

We have already seen in Chapter 5 that the smooth entropies, for any 
tripartite state Pabc and e > 0, satisfy the duality relation 

^min(^l^)p + ^max(^|C^)p>0. 

Our result now shows that this lower bound increases (as is the case for 
von Neumann entropies) when we apply incompatible measurements on 
the A system. This extends the entropic uncertainty relation of (7.5) to 
smooth entropies. 



^This can be seen a measurement of the joint system consisting of A and O. 
^Or, depending on the reader's preference, due to no-cloning. 
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eG{+, x} 

uniform 




(a) A quantum system in the state p is measured in one out of two bases, determined 
by a uniformly random bit, O. Then the basis choice Q is sent to an observer, O, 
who is asked to determine the measurement outcome, X. The observer is allowed 
to use classical information about the state p before measurement, including a full 
characterization of the density matrix. One may alternatively think of the observer 
preparing the state p before the game. The uncertainty relation (7.4) ensures that 
the entropy the observer has about the measurement outcome satisfies the lower 
bound H{X\OQ) > | log i. Hence, if the overlap is nontrivial, the observer cannot 
predict X with certainty. 

Given B, what is X? 



eG{+, x} 

uniform 




(b) This game can be trivially extended to two observers, Oi and O2. In this case, 
the sum of their entropies satisfies H{X\OiQ) + H{X\02Q) > log-. This scenario 
is mainly interesting in the quantum case. 

Figure 7.1: Uncertainty Relations with Classical Side Information. 
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e6{+, x} 

uniform 





(a) A quantum system, A, is measured in one out of two bases, determined by 
a uniformly random bit, O. Then the basis choice Q is sent to an observer, O, 
who is asked to determine the measurement outcome, X. The observer holds a 
quantum system which might be entangled with the measured system. One may 
alternatively think of the observer preparing the bipartite state pAO , of which the 
A part is measured and the O part is stored in a quantum memory. For this setup, 
no uncertainty relation of the type (7.4) exists. As a counter-example, consider the 
case where pAO is fully entangled. In this case, the observer can always choose a 
measurement (depending on 0) whose outcome is perfectly correlated with X. For 
example, if the state is a singlet, measuring both the A and O parts in the same, 
arbitrary basis will lead to perfectly anti-correlated binary variables. 

Given B, what is X? 



e6{+, x} 
uniform 




Oi 



O2 



(b) However, if we instead consider two observers, Oi and O2, an uncertainty rela- 
tion is possible. Note, in particular, that the counterexample provided for the case of 
one observer fails due to the monogamy of entanglement. As in the classical case, the 
sum of the entropies of the two observers satisfies H(X\OiO) + H {X\020) > log i. 
However, in the quantum case it is possible that one of these entropies is zero, which 
implies that the other entropy is large. This trade-off between the two entropies 
does not exist in the classical case and can be seen as an effect of no-cloning — it is 
generally not possible that the two observers hold copies of each other's quantum 
information. 

Figure 7.2: Uncertainty Relations with Quantum Side Information. 
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Result 7. For e > 0, any tripartite state Pabc o-s well as two POVMs 
{Mx} and {Ny} on A, it holds that 

H^^^{X\B) + H^^^(Y\C) > log -, where c = max 

c x,y 

This uncertainty relation has been used to prove security in quantum 
cryptography. In [TLGRll], it was used to derive tighter key rates for fi- 
nite block lengths in quantum key distribution. In [LSSll], Le et. al. prove 
security of a reference frame independent quantum key distribution proto- 
col using the above uncertainty relation.^ Furthermore, the result has been 
shown in the framework of general von Neumann algebras in [BFSll], indi- 
cating that the UCR for min- and max-entropies is a fundamental property 
of quantum physics and not a relict of the density operator formalism. 

Furthermore, we explore a tighter version of the uncertainty relation 
with a lower bound in terms of an effective overlap. The effective over- 
lap — in contrast to the overlap — is a function of the marginal state prior 
to measurement as well as the two measurements. A preliminary ver- 
sion of this result appeared in [HTll], where the relation is shown for 
von Neumann entropies. We extend these results here and show that 
a generalized uncertainty relation also holds for smooth min- and max- 
entropies, enabling its substitution into existing cryptographic security 
proofs [TLGRll, LPT+11]. 

Result 8. For e > 0, any tripartite state Pabc o,s well as two POVMs 
X = {Mj} and y = {Ny} on A, it holds that 



1 , 2 



^mln(^l^) + Hl,Ay\C) > log ^ - log ^2 ' 

where c* = ^^ tr(P'^/9^)cfc. Here, {P^} is any projective measurement 
that commutes with both X and y and Ck is the overlap of X and y on the 
subspace P^ . 

7.1.2 Outline 

The remainder of this chapter is structured as follows. In Section 7.2, we 
introduce the notion of overlap and show the uncertainty relation for the 
min- and max-entropy. The proof is very instructive and will also guide the 
proof of the generalized uncertainty relation, which is given in Section 7.3. 
There, we will also formally define the effective overlap. In Section 7.4, 
we discuss a variety of corollaries of the generalized uncertainty relation. 
We consider an application of the uncertainty relation to quantum key 
distribution. Finally, in Section 7.5, we consider a bipartite uncertainty 



^Applications of the uncertainty relation to quantum cryptography will also be the 
topic of Section 8.3 of the next chapter. 
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relation which might have applications in cryptographic settings where 
only two parties are involved. 



7.2 Traditional Formulation 

7.2.1 Overlap 

Let p E S<{'Kabc) be an arbitrary, tripartite quantum state. We want to 
bound the entropy about the result of a measurement on the A subsystem 
given side information stored in either B or C. Without loss of generality, 
such a measurement on the A system can be described by a POVM.^ 
Here we consider two POVMs, X = {M^} and y = {Ny}, on 'Ka. They 
measure the state on the A subsystem and store the measurement outcome 
in classical registers X and Y , respectively. This two registers can be 
seen as classical random variables that are correlated with quantum side 
information on the B and C systems. Note that, in our analysis here, we 
are not concerned with the state of the A system after the measurement. 
The relevant post-measurement states are thus given as 

PxBc = ^ \x){x\ ® ti a{\/MxPabc\/Mx) and (7.6) 

X 
PYBC = Y, \y)(y\ ^ ^^A{y%pABCy^) ■ (7.7) 

y 
We also define the overlap of these two measurements. 

Definition 7.1 (Overlap). Let X = {M^} and y = {Ny} he two POVMs. 
Then, we define the overlap of X and y as 

c{X,y) := max 

x,y 

This is in accordance with [KP02] , where an uncertainty relation (with- 
out side information) was first shown for von Neumann entropies. If the 
two measurements are projective, the expression for the overlap reduces to 

c = max |(x|y)| , 

x,y 

where the maximization is over all eigenvectors \x) of X and \y) of y. The 
name overlap is clearly motivated by this expression. 



""^This is true since we do not use the post-measurement state on the A system 
and, thus, the freedom to choose a phase in the decomposition of the POVM elements 
M = E^ E is meaningless. 
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7.2.2 Uncertainty Relation for Min- and Max-Entropies 

The following inequality (cf. [TRll]) bounds the sum of the min- and 
max-entropies of the post-measurement states in terms of the overlap. In 
the next sections, we will formulate a generalization of this statement to 
smooth entropies and mixed states. Here, we simply consider the special 
case (7.8), as its proof highlights the basic concepts and techniques used 
for the proof of the subsequent generalized uncertainty relation. 

Theorem 7.1 (UCR). Let p G 5<(^abc) be pure and X as well as y two 
POVMs on 'Jijx- Then, the post measurement states (7.6) and (7.7) satisfy 

H^^{X\B)p + H^^{Y\C)p > log ^^^ . (7.8) 

Proof. It will be helpful to describe the two measurements in the Stine- 
spring dilation picture (cf. Lemma 2.4) as isometrics followed by a par- 
tial trace. Let U be the isometry from A to A, X and X' given by 
U := Ylx V^x <S5 \x) (8" \x). The isometry stores two copies of the mea- 
surement outcome in the registers X and X' and the measured state in 
A. Here, {\x)} is an orthonormal basis of Jix — ^x'- Analogously, 
V := J2y \/^^^\y)'^\y)- Furthermore, we introduce the states Paxx'bc = 
U PabcV^ and Payy'bc = VPabc^^ , of which the post-measurement states 
appearing in Eq. (7.8), pxB and pyc, are marginals. 

The duality relation (cf. Lemma 4.1) applied to Payy'bc gives 

^max(^|C)p + ^min(^l^^'^)p = • (7.9) 

Comparing (7.9) with the statement of the theorem, it remains to show 
that H^^^^{Y\AY'B)p < H^-^{X\B)p - log ^ holds. More precisely, we will 
show that 

Hmm{Y\Y'AB)p = max sup{A € M : Payy'b < 2"^ly (g) aAY'B} 

a 

< sup{A eR: pxB< 2"^c tx (E> ag} (7.10) 

= H^,,{X\B)p-log-^. 

In order to arrive at (7.10), we thus need to show that, for any a £ 
S={'Kay'b), the following implication holds 

pAYY'B<2'^U(S>CrAY'B =^ Px B < ^'^C tx (^ (^B ■ (7.11) 

To show this, we apply the partial isometry W := UV"^ followed by a 
partial trace over X' and A on both sides of the inequality on the left-hand 
side. This implies 



2^PXB < tlx'A {W{1y «> aAY'B)W^) . (7.12) 
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Moreover, substituting the definition of W, we find that the trace term on 
the rhs. evaluates to 



vhs. = ^\x){x\ ^ {y\tv^{y/M^y%a^y'sy%VM^)\y) ■ (7.13) 
x,y 

Lemma A.l, in particular Eq. (A.l), now establishes that 

Combining this with (7.13) and (7.12) results in the inequality 

^^PxB < 2-^c Y^ \x){x\ {y\aBY' \y) = 2^^c Ix^cTb- 
x,y 

This establishes (7.11) and concludes the proof. D 

7.2.3 Modeling the Measurement Basis Explicitly 

An alternative formulation of the uncertainty principle requires an addi- 
tonal random variable, 0, which determines the choice of measurement on 
the A system. Consider, for example, the setup of the previous section, 
where the choice is between two incompatible measurements, X and 3^, 
which we assume both have the same number of different outcomes. In 
this case the random experiment of picking ■;? G = {0, 1} determines 
the binary choice of measurement. More specifically, say that t9 = leads 
to a measurement of X and t? = 1 leads to a measureinent of 3^. The 
measurement outcome, in either case, is stored in a classical register Z. 

If G is uniform and independent of the state p before measurement, we 
find 

H^^^{Z\BQ) + F^ax(^ICe) > log -^^ . (7.14) 

To see this, note that (7.8) implies 

2-^mi„(^|S) <c2^n.ax(>'|B) ^^^ 2-^min (^1^) < c 2^— ^^'^^ . 

Taking the convex sum with equal weight ^ of these two inequalities leads 
to (7.14), where we used the fact that the min- and max-entropies with 
classical side information can be expressed as averages (cf. Proposition 4.6). 
A generalization of this type of uncertainty relation to smooth entropies 
will be discussed below in Corollary 7.4. 
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7.3 The Generalized Uncertainty Relation 

7.3.1 Effective Overlap 

The overlap c{X,y) used in Theorem 7.1 is a function of the POVM el- 
ements of the two measurements under consideration and independent of 
the state prior to measurement. This is often desirable because this state 
might be unknown, or, in a cryptographic setting, prepared by an adver- 
sary. However, we will see that in some situations partial knowledge about 
the state before measurement can be used to improve the bound on the 
uncertainty. What follows is thus a generalized uncertainty relation of the 
form of Theorem 7.1 that introduces a trade-off between information about 
the marginal state before measurement and tightness of the uncertainty 
relation. Specifically, we consider the overlap /effective of a measurement 
setup, denoted c*, which describes the overlap of two measurements on a 
given marginal state. 

Definition 7.2 (Effective Overlap). Let p^ € 5<(!K_4) be a state and let 
X = {Mj,}, y = {Ny] he two POVMs on ^K^. Then, we call the triple 
{p_4, X, y} a measurement setup. The effective overlap of this measurement 
setup is defined as 

c*{pA,X,y) :=min| VtrfP^) max ||p^' V A^^A^iVy I 

K I ^-^ X 11 ^-^ -^ oo I 

"^ A: y ' 

where the minimum is taken over all projective measurements K, = {P^^ 
on !Ka that commute with both X and y.^ 

In the following sections, we will show that an UCR also holds for this 
definition of effective overlap. 

As a first example of the usefulness of such a generalized UCR, con- 
sider the scenario where we apply one of two projective measurements, 
either in the basis {|0), |1), |_L)} or {|+), |— ), |-L)} on a state p which has 
the property that '_L' is measured with probability at most t/.^ A direct 
application of the state-independent uncertainty relation (Theorem 7.1) to 
this setup will not lead to the desired results as the overlap of the two 
bases is trivially c = 1. Still, our intuitive understanding of this situation 
tells us that the uncertainty about the measurement outcome is high as 
long as r] is small. 

In fact, the effective overlap of this setup satisfies c* < {1 — rj)^ + rj. 
This formula can be interpreted as follows: with probability 1 — 77 we are in 
the subspace spanned by |0) and |1), where the overlap is ^j and with prob- 
ability rj we measure _L and have full overlap. To prove this upper bound, 



^The property that two measurements X and IC commute is equivalent to the con- 
dition M^P'' = P'^M^ for all x and k. 

^The diagonal states \±) are defined as |±) — (10) ± |1))/V2. 
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simply choose the projective measurement /C = {|0)(0| + |1)(1|, |-L)(-L|} in 
Definition 7.2. Hence, while Theorem 7.1 thus only provides a trivial bound 
for this example, an uncertainty relation in terms of the effective overlap 
would give the expected bound. 

We can always get a state-independent bound on the overlap by the 
choice fC = {1^}) that is 



c* < max \\S^ NyM^N^ 



Furthermore, note that for projective measurements X and 3^, the rhs. can 



be simplified to max. 



x,y 



{X 



agreement with the the usual overlap in 



Definiton 7.1. In general, it is conjectured that c* < c, i.e. that 



max 

X I 



y 



NyM^N^ 



< max 

oo x,y 



/NyM^jNy 



max 

x,y 



(7.15) 



Finally, if the measurements on the A system have binary outcomes, 
it is possible to upper bound the effective overlap (and, thus, lower bound 
the uncertainty) by the maximal CHSH Bell violation [CHSH69] that can 
be observed using this measurement setup on A with an arbitrary sec- 
ond party (i.e. on an arbitrary extension of the state and using arbitrary 
measurements by the other party.) This establishes an analytic relation 
between two fundamental concepts in quantum theory, Bell non-locality 
and uncertainty relations. We refer the interested reader to [HTll], where 
this relation is discussed in detail. 



7.3.2 The Generalized Uncertainty Relation 

We now consider a theorem that gives a very general formulation of the 
uncertainty principle for smooth entropies. It gives a lower bound on 
the uncertainty — in terms of smooth min- and max-entropies — about the 
outcome of two (incompatible) measurements, X and Z, conditioned on 
quantum side information and the result of and additional, projective mea- 
surement, K, that was done on the state before measuring X and Z. 

More formally, we consider an arbitrary tripartite quantum state, Pabc, 
two POVMs on A, X = {Mx} and y = {Ny}, as well as a projective 
measurement, /C = {Pk}- The post-measurement states when X and Y 
are measured after K, respectively, are thus 

pxKB = Y, I^X^I ® I^X^I ® ^'i^ac{V^xP^PabcP^Vm~x) and (7.16) 

x,k 

pYKc = Y\y){y\<< 

y,k 



iTAB{V^jP''PABcP''V^j)- 



(7.17) 
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The following theorem generalizes previously known uncertainty rela- 
tions for the smooth min- and max-entropies in a tripartite setting.^ The 
UCRs discussed in Section 7.4 are corollaries of this relation. 

Theorem 7.2 (Generalized UCR). Let e >0, e > and p G S<{'Kabc)- 
Moreover, let X = {M^} , y = {Ny] be POVMs on "Ka and /C = {P^} a 
projective measurement on "K^. Then, the post-measurement states (7.16) 
and (7.17) satisfy 

Hl^{X\KB), + Hl^^{Y\KC)p > log - and (7.18) 

cjc 

H'J+^X\KB), + H^„,,,{Y\KC), > log ^ - log | . (7.19) 

where the IC- overlap, Cjc, and the effective IC- overlap, c*^, are given as 

cr := max II \/M^P''\/iV^ 1 1 and 
k,x,y " V yiioo 

cl := Y^ tr(PV) max ^^P'^NyP'^M^P^NyP^ 
k y 

Proof We prove the statement for pure Pabc- Its generalization to mixed 
states then trivially follows from data processing inequalities for the smooth 
entropies (cf. Theorem 5.7). More specifically, assuming the result for pure 
states, we consider a purification Pabcd of Pabc-, for which the theorem 
holds with the substitution C — )• CD and then take the partial trace over 
D. As this cannot decrease the smooth max-entropy, the generalization 
follows. 

We consider the Stinespring dilation (cf. Lemma 2.4) of the joint mea- 
surement of X and /C, denoted U , which coherently stores the measurement 
outcome of X in registers X and X' and the measurement outcome of /C 
in K and K' . The isometry U : "Ka —5- ^axx'kk' is given by 

f7 := ^ \x)^ (g) \x)^, \k)^ (g) \k)^, (g, ^/M, P'' . 

x,k 

Similarly, we introduce the Stinespring dilation of the joint measurement 
of y and /C as 

We will also need the partial isometry W := UV\ which, using p^p^ = 
^kk'P^ , evaluates to 

W=Y^ 1^X2/1 ® \xM ® \k){k\ (g \k){k\ (g y/M^P''^. (7.20) 

x,y,k 



^The author is tempted — but will resist — to call it the mother uncertainty relation. 
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These isometries allow us to introduce the states Paxx'kk'bc = U PabcU^ 
and Payy'kk'bc = Vpabc^^ whose marginals are the post-measurement 
states pxKB and Pykc of (7.16) and (7.17), respectively. 

The proof now proceeds in several steps. First, we reformulate the 
statement of the theorem in terms of smooth min-entropies using the du- 
ality relation. Then, in order to show (7.18), we use use techniques sim- 
ilar to the ones discussed in Section 7.2.2 to extract the /C-overlap. To 
show (7.19), we first use Lemma A. 6 to find an upper bound on one of 
the entropies in terms of a relative entropy that is conditioned on p. The 
properties of p can then be used to extract the effective /C-overlap. 

Due to the duality between smooth min- and max-entropy (cf. Theo- 
rem 5.4), the first statement of the theorem is equivalent to 

H^^,^{X\KB), > H^^{Y\AY'K'B), + log - . (7.21) 

Let ^ be a state that maximizes the smooth min-entropy on the rhs. Then, 
there exists a a such that 

A = S,„UPayy'k'bUy cjay'k'b) = Hl,^{Y\AY'K'B)p . (7.22) 

The state p can be chosen classical on K' due to Proposition 5.8. 
Furthermore, note that the minimum distance purification of />, since its 
marginal is classical on K' , also inherits the coherence between K and K' 
from p. This follows from 

p(npn,p) = p(n/>n,n/>n) < p(p,p) , 

where H = ^^ \k){k\K ® \k){k\K' is the projector onto coherent superposi- 
tions of K and K' . We also used that ti jiilipU) = M.[p] = p, where M 
measures the K' system; thus, the projection does not change the marginal 
state. We also need another extension of p to X, which we define via the 
TP-CPM £ : /o ^ Efc \k){k\K' p\k){k\K' «> \k){k\K. This map measures K' 
and creates a (classical) copy of it in K. Since Payy'k'b is classical on K, 
it is easy to see that Tayy'kk'b = ^[Payy'k'b] is an extension of Payy'k'b- 
Furthermore, we can equivalently write Tayy'kk'b = ^{Payy'k'b ® Ik)!! 
and note that the two extensions agree on the diagonals, i.e. 

Vk : \kk\fAYY' KK' B\kk) = \kk\pAYY' kk' B\kk) . (7.2oj 

From the definition of SmmipAYY' k' bW'^y ® (^ay'k'b)-, we get 

Payy'k'b < 2"^ ly (Jay'k'b ■ (7.24) 

Taking the tensor product with Ij^ on both sides of (7.24) leads to 

PaYY' K' B tX" Ik ^ 2 "^YK ® <^AY'K'B ■ 
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We conjugate this inequality with W and take the partial trace over A, Y' 
and K' to get 

fxKB = tTAX'K' {W{PayY'K'B ® ^k)W^) 

< 2-^ tlAX'K {Wilyj, aAY'K'B)W^) . (7.25) 

We first evaluate the trace term on the rhs. of (7.25) using the expres- 
sion for W in (7.20) and Lemma A.l as in Eq. (A.l). We get 

tVAX'K' {W{tYK (^ (^AY'K'b)W^) 

= Yl l^)^^l ® l^)^^l ® {yH^rA{^/l^xP^^/^iCyAY'K'B^/NyP^^/Ml,)\yk) 

x,y,k 

y-^ , , , 2 

< Ix O 2^ \k){k\ (g) max ^jNyP^yJM^ {k\(yKB\k) 



x,y 
k 



tx ® OJk 



B 



The last equality defines the operator u. Note that tr(a;xs) < Cjc; hence, 
using Ukb = ^kb/c-k. S S<,{^kb) and (7.25), we find a lower bound on 
Srain{TxKB\\'^x ® ^kb) in terms of A and ck.. 

Srmn{fxKB\\tx ® Was) > A + log — = /7^i,,(y |.4y'K'5)p + log — , 

(7.26) 

where we substituted (7.22) for A. The state Txkb is sub-normalized. 
Furthermore, its purified distance from Pxkb is bounded by 

PyxKBj Pxkb) ^ P[PxKBj Pxkb) S £ j 

where we used that Txkb = Pxkb- This follows from the property that the 
two states agree on the diagonals of \kk) — in the sense of (7.23) — together 
with the expression 

Txkb = tv^x'K' {Wtayy'kk'bW^) = ^ \x){x\ \k){k\ (g) 



k,x,y,y' 



tiA[VMxP'"y%{yykk\fAYY'KK'B\y'y'kk)y^P 



^ ^x,, 



and the respective expression for Pxkb- Hence, using the definition of the 
smooth min-entropy, we get 

H^i^{X\KB)p > SminifxKBUx'SiUJKB) , 

which, substituted into (7.26), concludes the proof of (7.18). 

It remains to show the second statement of the theorem. Again, due 
to the duality between smooth min- and max-entropy this is equivalent to 

H^J+%X\KB), > Hl,^{Y\AY'K'B)p + log -^ - log | . (7.27) 
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Applying Lemma A. 6, we define the state p G B'^^'^'^ip) such that the 
following holds: 

SminipAYVK'sUY ® Pay'k'b) > H^^^{Y\AY'K'B)p - log ^ . 

We use the monotonicity of S'min under TP-CPMs to measure the K sys- 
tem, i.e. we apply the map Ai : t ^ ^^ \k){k\K' t \k){k\K' to both states 
in /Smin above. This will have no effect on Pay'k'b, which is classical on K' 
by definition. Using the state Payy'k'b = -MIpayy'k'b], we have 

A = Sn,in{pAYY'K'BUY ^ Pay'k'b) > H'^^{Y\AY' K' B) p " log ^ • (7.28) 
Moreover, the purified distance satisfies 

P(p,p) = P{M[p\M[p\) < P{P,P) < 2e + e. 
Prom the definition of SmmiPAYY' k" b\\^y "X" Pay'k'b), we get 

Payy'k'b < 2"^ ly <^ Pay'k'b , (7.29) 

where we employed the marginal state 

Pay'k'b = tVy^'iVpABV^) = "^^yP^ PabP^^v ®\k){k\ ® \y){y\. 

(7.30) 

The remainder follows the proof of the first statement closely. However, we 
will take advantage of the form of the marginal state in (7.30) to extract 
the effective /C-overlap. Taking the tensor product with 1^ on both sides 
of (7.29) leads to 

Payy'k'b iXi Ik ^ ^ ly^ Pay'k'b ■ 

We conjugate this inequality with W and take the partial trace over A, Y' 
and K' to get 

fxKB = tr AX' K'{W {payy'k'b <^ ^k)W'') 

< 2-HTAx'K{W{lyj, Pay'K'b)W^) . (7.31) 

We again evaluate the trace term on the rhs. of (7.31) using the ex- 
pressions for W and Pay'k'b in (7.20) and (7.30). We get 

tlAX'K' {W{1yK (» Pay'K'b)W'') 

= ^ \x){x\ (E) \k){k\ {yk\trA{VM^P''^/KJPAY'K'B^/KJP''VM^)\yk) 

x,y,k 

= Y^ \x){x\ (S) J2 \k){k\ ® tr^ [Yl VM,P%P''pABP''NyP''y^, 
X k y 

<tx(EYl l^)^^l ^ ™f^ \\Y,P''^yP^^xP''NyP^j ilAiP^PAB) 

k y 

(7.32) 

= tx® UJkb ■ 
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We used Lemma A.l to arrive at (7.32). The last equality defines the 
operator Cj. Note that tr(LJxfl) = c]^; hence, we choose ujkb = ^kb/c*iq G 
S={'KiiB) and (7.31) and find a lower bound on Suim{T'xKB\\'-^KB) in terms 
of A and c^. 

Sm^n{fxKB\\^KB) > A + log ^ > W^^^{Y\AY'K'B)p + log ^ - log % , 

(7.33) 
where we substituted (7.28) for A. We have 

P{txkb,Pxkb) = P{Pxkb,Pxkb) <2e + e, 
Hence, using the definition of the smooth min-entropy, we get 

H^i^{X\KB)p > Smm{fxKBUx <S) (Tkb) , 

which, substituted into (7.33), concludes the proof of the second statement 
and the theorem. D 

7.4 Miscellaneous Uncertainty Relations 

This section contains a collection of useful corollaries of Theorem 7.2, in- 
cluding the results discussed in the introduction of this chapter. 

7.4.1 Commuting Measurements 

A specialization of the generalized UCR which is of particular interest 
concerns the case when the two measurements X and y both commute 
with /C. In this case, the marginal states of (7.16) and (7.17) when K is 
traced out correspond to the post-measurement states when only X and 
3^ are measured, Eqs. (7.6) and (7.7). Formally, 

tr^KiPxKB) = ^ \x){x\ (g) tTAci'/M^PABcVM^) = PxB aud 

X 
tlKiPYKc) = ^ \y){y\ ^ tTAB{^/NyPABCy/Ny) = PyC ■ 

y 

They satisfy the following inequality. 

Corollary 7.3. Let e > 0, e' > 0, p G 5<(:K^flc) and X = {M^.}, 
3^ = {Ny} two POVMs on "K^. Then, the post-measurement states (7.6) 
and (7.7) satisfy 

H'Jn^%X\B)p + i/l.(y|C), > log ^.^p^^^^y^ - log I > 
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Proof. Let /C be the measurement — commuting with X and y — that min- 
imizes the effective overlap, c*{pA,X,y), in Definition 7.2. The corollary 
now follows from Theorem 7.2 applied to /C and the data processing in- 
equality of the smooth min- and max-entropy (cf. Theorem 5.7) applied 
for the partial trace over K. D 

7.4.2 Modeling the Measurement Basis Explicitly 

The full power of Theorem 7.2 comes to bear when we consider the follow- 
ing scenario. Let G be a classical register storing the choice of measurement 
that will be performed on the system A. For this purpose, we decompose 
Ji^, ^ :Ke (g) :K_4 and consider a family of POVMs {Z'^} on Ji^ that share 
the same output alphabet. For all ??, let Z = {-Lf } be the POVM on "Ka 
that is performed if iD is measured on 0. This process can be equivalently 
modeled as a POVM Z on 'Kq^, i.e. 



Z :-- 



{Y^mm^Lt}^. (7.34) 



If Pqabc = XlijI^X'^^le ® P^ABc is ^'^ arbitrary state classical on ©, the 
post-measurement state is 

PezBc = Y. I^X^I ® I^X^I ® trA{^/Lt p'^ABc^) ■ (7-35) 

This leads to the following result, which, in contrast to the uncertainty 
relations discussed above, relates the smooth min- and max-entropies of 
the same post-measurement state. 

Corollary 7.4. Let e > and peABc = Ei? I^X^Ie ^ Pabc ^ S<{:KeABc) 
be classical on and let f : Q ^ Q be a bijective function such that 
Pabc ~ Pabc- Then, the post-measurement state (7.35) after measuring 
the POVM (7.34) satisfies 

H^,,{Z\eB)p + H^,,iZ\eC)p > log 1 , 

where cj = max^ c{Z'^, Zf^"^'). 

Proof. We consider the POVMs X = Z a.ndy = {Y,^ \^){'d\ U^^^] ^ as 
well as the projective measurement /C = {I^X"*^!}' k = •&. It is easy to verify 
that fC commutes with X and y and that the /C-overlap of Theorem 7.2 
evaluates to c/c = Cf. Thus, 

H'^,,iX\eB), + H^,,,{Y\eC)p > log - . 
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Now, the corollary follows from the observation that the post-measurement 
states are equivalent up to local isometrics, pexs = Pezs and Perc = 
F^PqzcF, where F = J^^ |/(^))('*9| and we used the natural isometries 
\i) I—)- \i) between the Hilbert spaces "Kz — "Kx — 'Jiy Clearly, the smooth 
entropies are invariant under local isometries (cf. Proposition 5.3). D 

Note that a similar result can be derived based on the effective overlap 
formulation of the uncertainty relation, resulting in an overlap of c*r = 

E^tr(/)c(Z^Z/W). 

Finally, the uncertainty relation (7.14) of Section 7.2.3 is a special case 
of this corollary where Pbabc = \'^e® Pabc and / is the bit flip. 

7.4.3 The von Neumann Limit 

Using the asymptotic equipartition property, we directly get an uncertainty 
relation for the von Neumann entropy as well. 

Corollary 7.5. Let p e S={Kabc) and X = {M^}, y = {Ny} two POVMs 
on "Ka and K, = {P^} a projective measurement on "K^- Then, the post- 
measurement states (7.16) and (7.17) satisfy 

H{X\KB)p + H{Y\KC)p > log ^ , 

where cx, is defined in Theorem 7.2. Furthermore, it holds that 

1 



Proof. We prove the first statement, from which the second follows by the 
same considerations that led to Corollary 7.3. 



Consider an n-fold tensor product Hilbert space ^a"b'^c" — ^ 



ABC 

for an arbitrary n E N. We consider i.i.d. product states of Pabc on this 
space, i.e. the states Pa^b'^c^ = Pabc- Furthermore, we define i.i.d. product 
measurements A'" = X^"- := {(^^ Mxi}x", where x" = (xi, X2, . . . , x„) is a 
string of n measurement outcomes. Similarly, 3^" = 3^®*^ and /C" = /C®". 
Clearly, the post-measurement states of the n-fold measurement setup also 
have i.i.d. product form, i.e. Px^k^b" = PxliB and Py^k'^c" = Pykc- 
The effective overlap of the n-fold setup can now be calculated as 



k" i y" i 

= V n tr(P^-V) max II'Vp''' Ny^P'^^M^^P'^^Ny^P''^ 

k" i Vi 

= n E tr(^'V) max \\Y,P''''Ny^P'''Mx^P''^Ny^P''^ 

^i II 

i ki Vi 

= {clcT (7.36) 
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The uncertainty relation for smootii min- and max-entropies, Theo- 
rem 7.2, now states that 

H^J^^'{X^\K^B-), + i/l,(Z"|/^"i?")p > log 

for any s,e' > such that e + 2e' < 1. We divide this inequality by n and 
use expression (7.36) for the effective overlap above to get 



1 


, 2 




- log ^ . 


CK" 



-^^n''(^"l^"^")p + -HL^{Z^\K^B-), > log ^ - i log ^ . 

Taking the limit n — )• oo using the asymptotic equipartition property 
(Corollaries 6.6 and 6.7), we find the uncertainty relation for von Neu- 
mann entropies. D 

7.4.4 The Quantum Key Distribution Setup 

A specific example of the setup of Section 7.4.2 is very relevant in the 
application of the uncertainty relation to quantum key distribution. (See, 
for example, Section 8.3.) 

We use 0" G {0, 1}" as a basis choice for a measurement on an n-partite 
system A"'. The measurement we consider is a product measurement, 
where, on the i-th part of the system, the measurement Z'^^ = {Lf*}^ is 
executed depending on the corresponding bit "di in t?". The full POVM 
corresponding to this measurement process is 



n 
\ ^ l„Q™\/„an 

1?" i=l 



n 

\^'"W\(^®I^%} ^. (7.37) 



The result is an n-bit string z", which is stored in a classical register Z". 
Hence, the post-measurement state — when G" is uniform and independent 
of /O — is given as 

Ps-z-Bc = Y. ^I^"X^"I » tr^" ((8) 4^1Pa-bc (g) VS^) . (7.38) 

Corollary 7.6. Let e > 0, e > and p £ 5<(:Kf" ® JiBE)- More- 
over, let Z^ and Z^ be two POVMs on "K^- Then, the post-measurement 
state (7.38) that is produced by measuring Z in (7.37) in a uniform and 
independent basis, G", sastisfies 

F^i„(Z"|G"i?), + i/U(^"|0"C'), > nlog ^^J^2^y 
Furthermore, if the marginal Pa^ = Pa is of i.d.d. product form, we have 

H'j^%z-\e^B), + H^^^iz-\e-c), > nlog ^.(^pjz^^z^) - ^°^ I- 
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Proof. To prove the first statement, we start with Corollary 7.4 and the 
measurements described above. The function / is in this case the bit-flip 
on the whole string and the condition p^ngf^ = Pa^Ic i^ trivially satisfied 
since is independent of p by assumption. Finally, it is easy to verify that 
the overlap c/ evaluates to c{Z^ ., Z'^)'^ . 

The second statement requires an additional, projective measurement 
K, on "Ka that commutes with both Z^ and Z^ . Let K, be the measurement 
that minimizes the effective overlap c*{pa,Z^,Z^) in Definition 7.2. We 
measure fC on all n subsystems together with 0" resulting in an additional 
classical register K"^. Using Theorem 7.2, this leads to the relation 

1 , 2 

c*^{Pa,Z»,Z^) e^ 

where we used the arguments in the proof of Corollary 7.5 to simplify the 
expression for the effective overlap. The statement then follows after a 
partial trace over K"^ on both entropies. D 

Note that the above results can be extended to the case where measure- 
ments and the marginal state are of general product but not i.i.d. form. In 
this case, the logarithm of the overlap on the rhs. is replaced by an average 
over the logarithmic overlap on all subsystems. 

7.5 Bipartite Uncertainty Relations 

We have argued that the natural generalizations of uncertainty relation 
without side information to uncertainty relations with quantum side infor- 
mation introduces two distinct observers. Here, applying the chain rules 
for the smooth entropies introduced in Section 5.6, we show a bipartite un- 
certainty relation. Note that such an uncertainty relation necessarily needs 
to have a term on the rhs. that characterizes the entanglement between the 
observer and the system prior to measurement. If entanglement is present, 
the bound has to be reduced accordingly. In particular, if two projective 
measurements are executed on a system that is fully entangled with the 
observers, there is no uncertainty on the measurement outcome.^ Hence, 
we see a trade-off between entanglement and uncertainty in bipartite un- 
certainty relations — the more entanglement there is prior to measurement, 
the less uncertainty is created. 

Here, we show that this picture is still not complete. In order to get 
a tight result for arbitrary projective measurements and POVMs, we also 
need to consider the entanglement that is left after measurement. This is 



^Sce, for example [Bcr08], where this is discussed for an uncertainty relation in the 
von Neumann limit and for rank-1 projective measurements. 
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due to the fact that general measurements (in contrast to rank-1 projective 
measurements) do not necessarily destroy all entanglement between the 
two parties. 

This result might have applications in two party quantum cryptogra- 
phy, where uncertainty relations are used to bound the knowledge of one 
party about the measurement results of the other party. An example of 
such an applications of bipartite UCRs can be found in [BFWll]. 

We consider a bipartite system shared between A and B, where, as 
before, the system A is measured using one of two measurements X or y. 
The entanglement before measurement is characterized by the smooth min- 
entropy of A given B while the entanglement after measurement is given 
by the smooth max-entropy of the system after measurement, denoted A' 
given B and the measurement outcome. More precisely, we start with an 
arbitrary state Pab and consider post-measurement states 

Pa'xb = y^ \x){x\ (g) ^/M^PabVm^ and (7.39) 



PA'YB=Y,\y)(y\'^V^PAB^/N^- (7.40) 

y 

Note that, in contrast to the entropic uncertainty relations discussed 
earlier, the system A' will appear explicitly in our statements. Since the 
state of this system is not unique for a given POVM, we simply chose the 
simplest measurement TP-CPM consistent with the POVM, which leads 
to Eqs. (7.39) and (7.40). This implies the following uncertainty relation.^ 

Theorem 7.7. Let e > 0, e > 0, i > 0, e' > and Pab G '5<(IK^b). 
Moreover, let X andy be two POVMs o'ti'Ka- Then, the post-measurement 
states (7.39) and (7.40) satisfy 

Hi^{X\B), + H^,,{Y\B), > H'^^iA\B), - Hi,^{A'\YB)p 

1 2 

+ log— -41og^, 

where e = 7i?-F6e-F4e' + 8e and c* = c*{pA,X,y). 
Proof. We start from Corollary 7.3, which implies 



1 . 2 



HLin{X\B), > H^^+'^ +^^{Y\^Y'B)p + log - - log _, . 

Now, we apply the chain rule (5.13) to the min-entropy on the rhs. with 
the substitutions e -^ e, e' -^ e and e" ■'^ e + e' + 2e as well as the systems 
A^Y,B ^ A'Y' and C ^ B. This leads to 

Hi,^{X\B)p > Hi,,,{A'YY'\B)p - H^^tI'-''{A'Y\B)p + log 1 - 31og | , 



^Other uncertainty relations of the types discussed above can be made bipartite too. 
Here, we are interested in a simple example. 
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where we used that Y and Y' are interchangeable. Then, we apply the 
chain rule (5.12) to the max-entropy with the substitutions e -^ e, e' ^ e' 
and e" •(— e as well as the systems A -^^ Y' , B ^ A' and C ■^ B. This 
directly leads to the statement of the theorem when we note that 

Hi,^{^YY'\B)p = Hi,^{A\B)p (7.41) 

due to the covariance of the smooth entropies under isometries (cf. Propo- 
sition 5.3). n 

We are also interested in the von Neumann limit of this uncertainty 
relation. Using the AEP and the same techniques as in the proof of Corol- 
lary 7.5, we find 

H{X\B)p + H{Y\B)p > log ^ + H{A\B)p - H{A'\YB)p . 

Due to the symmetry of the Ihs. of this expression, we can replace the Y 
on the rhs. by an X; thus, the lower bound is effectively a function of the 
minimum of these two entropies. In the following, we prefer to model the 
basis explicitly and use that H{Z\eB) = \H{X\B) + \H{Y\B) if the 
basis choice is uniform. We get^ 

1,1 1 

- log — I — 

2 ^c 2 

Note also that if the measurement is projective and rank-1, the system 
A' will simply contain a copy of Z and the second entropy thus vanishes. 
This leads to the uncertainty relation of [BCC^IO], 

H{Z\QB)p>]^{\og-^+ H{A\B)p). 



H{Z\@B)p > log - + - [H{A\B)p - H{A'\ZQB), 



7.5.1 Chained Uncertainty Relations 

Consider two consecutive applications of the uncertainty relation — first 
measure Zi of a system Ai in the basis 0i and then Z2 on A2 in the basis 
02. We can now derive two different bounds on the total entropy produced 
by these operations. First, we consider the two measurements together to 
get 

H{Z\QB)p > log ^ + ^ {h{A\B)p - H{A\ZQB)p) , (7.42) 

where = Bi02, Z = Z1Z2, A = A1A2 and A' = A'^A'2 contains the 
systems Ai and A2 after measurement. On the other hand, we may write 
down the uncertainty of the first measurement separately. 

H{Zi\eiB)p > i log ^ + ^ (^H{A\B)p - H{A[A2\ZieiB)p 



^For the following arguments, we replaced the effective overlap by the overlap. 
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Now, the second system is measured. Namely, we consider a measurement 
of Z2 in the basis 02 on the second part of the joint system Al^A^- The 
observers holds the information gained in the first measurement, i.e. 0i 
and Z\ in addition to B. 

H{Z2\Z^eB)p > ^ log ^ + ^(^H{A[A2\ZieiB)p - H{A'\ZeB)^ 

Adding the previous two inequalities leads to 

H{Zi\eiB)p + H{Z2\Zi@B)p > log - + 1(h{A\B)p - H{A'\ZeB), 



(7.43) 

The two bounds, Eqs. (7.42) and (7.43), are equivalent since Zi is inde- 
pendent of 62 and, thus, H{Zi\@iB) = H{Zi\QB). 

This shows that we can split the process of producing uncertainty into 
two individual steps without loosening the bound on the uncertainty. In 
this sense, the bipartite uncertainty relation can be considered tight. 
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Chapter 8 

Applications 



This chapter discusses three example apphcations of the smooth entropy 
framework and uses results from Chapter 3-7. 

In Section 8.1, we consider source compression, which has been used 
as an example in the introductory remarks, and show how the characteri- 
zation of this task in the one-shot setting allows us to retrieve direct and 
converse bounds on coding for finite block-lengths as well as strong con- 
verse statements. In Section 8.2, we consider randomness extraction, a 
task that is very important in cryptography and is naturally considered in 
the one-shot setting. Then, in Section 8.3, we combine the above results 
and show how, in conjunction with an uncertainty relation, they allow us 
to prove security of a quantum key distribution protocol. 

8.1 Full Characterization of Source Compression 

This section provides an example of how the one-shot characterization of 
an information theoretic tasks is sufficient to derive bounds on the resource 
usage for finite block lengths and in the i.i.d. limit. 

We consider source compression with quantum side information, or, 
equivalently, information reconciliation. In particular, we give a strong 
converse to information reconciliation. We employ many results of this 
thesis, including properties of the purified distance from Chapter 3, the 
data processing inequalities of Chapter 4 and the asymptotic equipartition 
property of Chapter 6. 

The bounds derived here clearly also hold in case the side informa- 
tion is classical or non-existent. Thus, we provide bounds for classical 
source compression tasks as well. From this viewpoint, the following sec- 
tions prove Shannon's source coding theorem [Sha48], as well as its exten- 
sion to side information (Slepian-Wolf [SW73]) and quantum side infor- 
mation [DW03].^ We also prove a strong converse of source compression 



^The Slepian-Wolf setting with two separate encoders can be viewed as source com- 
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with side information. (See also [Ooh94], where a the error exponent of 
the Slepian-Wolf strong converse is investigated.) Related to this work, a 
strong converse for compression of quantum information [Sch95] was shown 
by Winter [Win99]. 

8.1.1 One-Shot Characterization 

The one-shot results are extracted from a recent paper by Renes and Ren- 
ner [RRIO]. Given a classical-quantum state Pzb^ shared between two 
parties, Alice and Bob, how much information needs to be transmitted 
from Alice to Bob such that Bob can reconstruct Z with probability of 
error at most el To investigate this question, we consider non-interactive 
(one-way) information reconciliation protocols from Alice to Bob. These 
consist of an encoding function, e : Z — )• A^, that creates a message M to 
be sent from Alice to Bob. Bob then uses a decoder, a POVM V = {D^'} 
acting on the joint state of B and M that produces a classical estimate 
of Z in the register Z' {'Jiz — ^z')- A protocol is thus characterized by 
the tuple V = {e,T>}. Note that this is the most general model for a 
non-interactive information reconciliation protocol. In particular, since we 
consider a fixed state pzB , it is sufficient to consider deterministic encoding 
strategies 

The initial state pzB is of the form 



PzB 



TPz{z)\z){z\z^pI, (8.1) 






where Pz{z) is the distribution over the input alphabet Z and /?| S S={^b) 
are quantum states on B. Applying a protocol {e,2?} as described above 
to this state results in the final state 



Pzz' 



Y,Pz{z)tT [d,,{p% \e{z)){e{z)\M))\z){z\z \z'){z'\z', 



or, equivalently, the joint probability distribution Pzz'iz,z') = Pz{z) 
tr (^Dzi^pg (g) \e{z)){e{z)\^Y The error probability of this protocol is 



Perr(V,pzB) := 1 " ^^ ^(^) ^^ [Dz{pl ® \e{z)){e{z)\ 

z 

The error probability is thus equivalent to the trace distance between pzz' 
and the state Xzz' = Ylz Piz)\z){z\z'^\z){z\z' describing perfect correlation 
between Z and Z' . 

We characterize information reconciliation of a state pzB from Z to B 
with the minimum message length (in bits), m^, required to achieve an 



pression followed by source compression with side information. 
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error probability of at most e. 

m'^{Z\B)p := min{?7i : 3V s.t. logdM = m and Perri'P, Pzb) < £}• 

We now slightly extend a result from [RRIO]. The converse bound 
found there is not strong enough for our purposes when e gets close to 1. 
However, this can be fixed easily. 

Theorem 8.1. Let pzB S S<{'Kzb) be classical on Z and let < e < 1. 
Then, 



H^^{Z\B), < m%Z\B), < H^,,{Z\B)p + 2 log - + 4, 

for any Ex^Ei s.t. ei + £2 = £■ 

Proof. The direct bound, nf{Z\B)f, < H^^^{Z\B) p + 2log ^ + 4, is shown 
in [RRIO]. To get a converse bound, we also follow their argument and 
note that Perri'P, Pzb) < e is equivalent to the condition D{pzz', Xzz') < e. 
This implies P{pzz',Xzz') < \/2e — e^ = e' according to Proposition 3.3. 
Hence, 

HL.iZ\Z')p<H^.AZ\Z\ = (8.2) 

by definition of the smooth max-entropy. The data-processing inequality 
(Theorem 5.7) then states that H^^^{Z\MB)p < before measurement. 
And finally we use the fact that conditioning on classical information 
M can at most reduce the smooth max-entropy by \og dj^i (cf. Proposi- 
tion 5.10). This leads to the following inequality: 

> Hi,^iZ\MB)p > Hi,^{Z\B)p - \ogd,,. 

Hence, all protocols with Perri'P-, Pzb) ^ ^ have to satisfy the constraint 
log c^M > H^^-^iZ\B) p. This results in the improved converse bound of the 
theorem. D 

8.1.2 Finite Block Lengths and Asymptotic Limits 

The one-shot result in principle characterizes all data-reconciliation tasks. 
Comparing the one-shot analysis with asymptotic results, where ttt,^ con- 
verges to the von Neumann entropy [DW03], we might be interested to 
know how fast the one-shot result in terms of the smooth max-entropy 
convergences to the von Neumann entropy. 

For this purpose, we consider the task of encoding a block of length 
n of i.i.d. random variables Z about each of which Bob independently 
has quantum side information B. This is data reconciliation for the state 
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Pz^gn = p®g and Theorem 8.1 applies. The minimum message length (per 
round) for this task, frf{Z\B) = -rrf{Z^\B'^), thus satisfies 

Ih^^{Z-\B-), < m%Z\B), < l(H^Jl{Z-\B-), + 2log-+6), 

(8.3) 

where we chose ei = £2 = | for convenience of exposition. Next, we bound 
the smooth max-entropies using the asymptotic equipartition property in 
its entropic form. 

Direct Bounds 

We start with the upper bound in (8.3). For sufficiently large n, the AEP 
states that (cf. Theorem 6.4 and Corollary 6.5) 

41ogt;Vg(§) 21og- + 6 
fh'{Z\B),<H{Z\B)p+ ^ V^l2^ ^ ^' , (8.4) 

vn n 



where 5(e) = - log (l-Vl-e^) and v = V2--f^min(^l^)p+V2^max(^|s)p + i. 
Hence, in the asumptotic limit of large n, we have 

lim m'iZlB)^ < H{Z\B)p. 

n—>-oo 

Namely, there exists an encoding strategy that transmits at most H{Z\B)p 
bits per round to Bob with an arbitrarily small positive probability of error 
(e > 0). The converse question is whether we can do better than this. 

Converse Bounds 

To see that this is not possible, even for large error probabilities, we ap- 
ply the AEP to the lower bound in (8.3). In the following, we use e to 
denote the success probability, i.e. 1 — e is the error probability. Using 
Proposition 5.5 and Remark 5.6, we find 



>ifF^iJZ"|i?"),-2fo 



1 - (e e + ^/r^^VT^i^ y 



Here, the smoothing parameter of the min-entropy, e, is restricted by the 
inequality arcsin(e) + arccos(e) < |. This holds if and only if e < e and, 
for convenience of exposition, we choose e = |. This ensures that the 
logarithmic correction term is finite for any < e < 1. For the fun of 
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introducing a symmetry with Eq. (8.4) where there is none (at least not in 
the form of the exphcit deviation terms), we further bound this as fohows.^ 

^'''{Z\B). > I U^^^inS^P - 2 log i - eV 

Using the AEP, this time for the min-entropy, leads to 

rn^'%Z\B), > H{Z\B), - '^"^"/^ - ^J^^^\±l. (8.5) 

Vn n 



Hence, in the asymptotic limit, for all < e < 1, we find 
lim m^-'' (Z\B) . > H(Z\B)p. 

71— >CXD 

This is the converse of data-reconciliation. It states that, even for an 
arbitrarily small success probability (e > 0), there does not exist a protocol 
that allows us to decode using less than H{Z\B)p transmitted bits per 
round. 



8.1.3 Strong Converse 

Finally, the non-asymptotic converse bound, Eq. (8.5) gives us the means 
to make this statement more precise. For example, we can ask how the 
maximal success probability, ffmax; scales if we use a protocol that encodes 
only H{Z\B)p — jj, bits per round for large n. For this analysis, we neglect 
the term scaling reciprocally in re, i.e. we consider the approximate bound 

1 «■. , X ^ , , . 41ogt;^/5(|) 



m'--%Z\B),>H{Z\B) 



n 



This implies that 



emax(Ai) ^ sup <^ e > : ^ > fi 



. 4:logVy/g{ 



n 



< \/2-2 2(4 log. )^. (8.6) 

In order to find (8.6), we used the bound g{£/2) < log \ (cf. Figure 6.2). 
This establishes that the maximal success probability drops exponentially 
in re and ^^ if we transmit less than the Shannon limit, H{Z\B)p bits per 
round. The exponential drop of the success probability in re is called the 
strong converse. 



^ We hope the reader does not mind us skipping the details of this simple procedure, 
which we have tested numerically and analytically. 
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8.2 Randomness Extraction 

This section discusses randomness extraction, the art of extracting uni- 
form randomness from a biased source. More precisely, we consider a 
source that outputs a register Z about which there exists side informa- 
tion E — potentially quantum — and ask how much uniform randomness 
5", independent of E, can be extracted from Z. This primitive is of crucial 
importance in many cryptographic tasks, for example in quantum cryp- 
tography. There, we are interested to distill a secret key from a raw key 
that is partially correlated with a quantum eavesdropper. 

The quality of the extracted randomness is measured using the trace 
distance; we consider the distance 

A(5|S)p := minD(/9sE,vrs (gXTe), where tts = -— l^ 
o-B ds 

and the optimization is over all Ge S S={'Ke)- Due to the operational inter- 
pretation of the trace distance as a distinguishing advantage (3.1), a small 
A implies that the extracted random variable cannot be distinguished from 
a uniform and independent random variable with probability more than 
2(1-1- A). This viewpoint is at the root of the universally composable secu- 
rity framework [CanOl, PWOl] in cryptography, which was extended to the 
quantum setting in [UnrlO] based on earlier work, for example in [Ren05]. 
We allow probabilistic protocols to extract this uniform randomness. 
These can be modeled by introducing an additional independent random 
seed, stored in a register F and then applying a (deterministic) function 
/ G F on X to get S. Namely, we consider a joint state Pzef of the form 

PZEF = 5]Pz(^) \Z){Z\, ® pl ^P^(/) l/X/U, P% G S<{-Ke). 

f 

and require that ^ Pf(/) A(5|£')p/ is small, where psE is the state pro- 
duced when / is applied to the Z register of pzE-^ A randomness extraction 
protocol, V, thus consists of a probability distribution Ppif) on a register 
F and a family of functions f & F from Z to S. 

The maximal number of bits of uniform randomness, i^, that can be 
extracted from a state pzE is defined as 

e{Z\E)p := max {£: 3V s.t. log(is = ^and ^Ppif) A{S\E)pf < e}. 
A protocol that satisfies Yl Ppif) ^{S\E) f < e is called e-good. 



^This is satisfied if if A.{S\EF)p is small, i.e. if the extracted randomness is inde- 
pendent of the seed F as well as E [TSSRll]. This is also called the strong extractor 
regime in classical cryptography. 
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8.2.1 Direct Bound 

A particular class of protocols that can be used to extract uniform random- 
ness are based on two-universal hashing (see Carter and Wegmann [CW79]). 
The classical Leftover Hash Lemma [Mcl87, ILL89, IZ89] states the amount 
of extractable randomness is at least the min-entropy of Z given E. This 
construction was then extended to the quantum setting in [KMR05, RK05, 
Ren05]. In fact, since hashing is an entirely classical process, one may ex- 
pect that the physical nature of the side information is irrelevant and that 
a purely classical treatment is sufficient. This is, however, not necessarily 
the case. For example, the output of certain extractor functions may be 
partially known if side information about their input is stored in a quantum 
device of a certain size, while the same output is almost uniform condi- 
tioned on any side information stored in a classical system of the same 
size.^ For protocols based on two- universal hashing, the following direct 
bound holds [TSSRll]. 



i'iZ\E),> HXiZ\E) p-2\og- + l, where £ = 81+82- (8.7) 

£2 

Other protocol families that extract the min-entropy against quantum ad- 
versaries^ are based on almost two- universal hashing [TSSRll] or Trevisan 
extractors [DPVR09]. 

Note that the protocol families discussed above work on any state pzE 
with sufficiently high min-entropy, i.e. they do not take into account other 
properties of the state. Next, we will see that these protocols are essentially 
optimal. 

8.2.2 Converse Bound 

We prove a converse bound by contradiction. This forixializes an intuitive 
argument given in [TSSRll]. Assume for the sake of the argument that we 
have an e-good protocol that extracts £ > H^^^{Z\E)p bits of randomness, 
where e' = \28 — 8^. Then, due to Proposition 5.11 we know that applying 
a function on Z cannot increase the smooth min-entropy, thus 

V / G F : F^iJ5|F)^/ < I and, thus, A(5|F)^/ > 8. (8.8) 

The second statement of (8.8) follows from the following observation. The 
inequality li'^^J^S\E') pS < £ implies that all states p with D{psE, Pse) < £ 
(cf. Proposition 3.3) must necessarily satisfy Hj^^^{S\E)p < i. In particu- 
lar, these close states can thus not be of the form tt^ cr^, because such 
states have min-entropy i. Hence, A{S\E)pf > e. 



^See [GKK+07] for a concrete example and [KRll] for a more general discussion). 
^ These families are considered mainly because they need a smaller seed or can be 
implemented more efficiently than two-universal hashing. 
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However, (8.8) implies that Yl Ppif) ^{S\E)pf > e, contradicting our 
initial assumption that the protocol was e-good. This implies the following 
converse bound 



e{Z\E), < H^^\Z\E),. (8.9) 

Collecting (8.7) and (8.9), we have the following theorem 

Theorem 8.2. Let pzE S S<{'Kze) be classical on Z and let {) < e < 1. 
Then, 

H'A.{Z\E)p - 2 log 1 + 1 < e{Z\E), < H^^{Z\E)„ 

for any £1,62 s.t. ffi + £2 = £■ 

We have established that the extractable uniform and independent ran- 
domness is characterized by the smooth min-entropy. A similar analysis 
for finite block lengths and the i.i.d. limit as in Section 8.1 is thus possi- 
ble. However, we omit it here since most applications consider the task of 
randomness extraction only in the one-shot setting. An example of such 
an application, quantum key distribution, follows in the next section. 

8.3 Security in Quantum Key Distribution 

The smooth entropy formalism was first introduced in the quantum setting 
by Renner [Rcn05] in order to prove information theoretic security of quan- 
tum key distribution (QKD) protocols [BB84, Eke91] against adversaries 
restricted only by the laws of physics. Prior to this work, the security of 
QKD protocols was mostly analyzed in the limit of long keys and using 
questionable security definitions [KRBM07]. 

In this section, we show that the uncertainty relations for smooth en- 
tropies in Chapter 7 can be employed to provide a very concise and intu- 
itive security proof for QKD. This is based on two recent papers [TRll] 
and [TLGRll]. The latter paper contains a thorough analysis of the sta- 
tistical tests required to assert security of the final key that goes beyond 
the discussion here. 

8.3.1 The Protocol 

We consider the original BB84 quantum key distribution protocol [BB84]. 
Here, two legitimate parties, Alice and Bob, try to distill a shared secret 
key that is independent of all wiretapped information. To do this, Alice and 
Bob share a public quantum channel and a public, authenticated classical 
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channel. In the entanglement-based version of the BB84 protocol [BBM92] , 
Alice now prepares n fully entangled qubit pairs of the form 

IV') = ^(|o)a. ® |0U + |i)a, ® |1)b.). (8-10) 

where i G {1, . . . , n}. She then sends the systems B" = Bi . . . Bn over the 
public quantum channel to Bob. 

Next, Alice chooses a basis for each qubit, Gj G {0,1}, uniformly at 
random. She communicates the basis choices, 0" = ©i...0„, to Bob 
over the classical channel and then measures her qubits Ai in the basis 
X = {|0), |1)} if Gi = or 3^ = {|+), I-)} if Gi = 1.^ The collected mea- 
surement outcomes, called raw key, are stored in a string Z" = Zi . . . Z„. 
Bob does a measurement on each of his systems depending on the value of 
Gj to produce a raw key Z". This already concludes the quantum part of 
the protocol. 

Next, Alice and Bob employ a one-way data reconciliation protocol 
that allows Bob to create an estimate Z" of Z" using his raw key Z". 
Finally, Alice invokes a randomness extraction protocol on her string Z 
to distill a key S and broadcasts the seed, F, over the classical channel in 
order to allow Bob to extract an estimate S of S. The latter step is usually 
called privacy amphfication [BBR88, BBCM95]. 

We note that, if the eavesdropper does not interfere and Bob simply 
measures his part of the entangled pair using the same bases as Alice, the 
resulting raw keys, Z*^ and Z", will be perfectly correlated. In this case, 
security of the key simply follows from the monogamy of entanglement 
(i.e. Bob is the preferred observer of Alice's system) and no information 
reconciliation is necessary. In the next section, we analyze this protocol 
in the presence of an eavesdropper and noise. (The effects of noise and 
wiretapping can in general not be distinguished; hence, we consequently 
assume that correlations are degraded due to wiretapping on the quantum 
channel.) 

8.3.2 Security in the Finite Key Regime 

Under what conditions will the key extracted by the above protocol be 
both secret (i.e. uniform and independent of the eavesdropper's informa- 
tion) and correct (i.e. S = S)l We will see that quantum mechanics allows 
us to ascertain both secrecy and correctness from the correlations between 
Alice's measurement outcomes and Bob's side information about them. 
This is in contrast to classical theory, where, without further assump- 
tions, we could only hope to determine correctness from such correlations. 
Quantum mechanics enables this due to the asymmetry between different 



^The diagonal basis is given by 1±) = 1/^2(10) ± 11)). 
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quantum mechanical observers, as discussed in the introduction of this the- 
sis. If it can be estabhshed that one observer, Bob, is a preferred observer 
of a quantum system Alice holds, then this ensures that an eavesdropper's 
knowledge about the outcomes of measurements performed on that system 
is limited. Here, we show how this asymmetry can be verified using the 
entropic uncertainty relation of Chapter 7. 

Since the eavesdropper is allowed to interact arbitrarily with the quan- 
tum communication sent from Alice to Bob, we do not make assumptions 
about the form of the resulting state Pa"b^e- In particular, the systems Bi 
that arrive at Bob's lab do not need to be qubits and E is an arbitrarily 
large system, held by the eavesdropper, that may be correlated with the 
systems A^ = Ai . . . An and B^. 

The uncertainty relation for smooth min- and max-entropies, in par- 
ticular Corollary 7.6, can be applied to this setup. It states that 

H^^,^{Z^\@^E) + H^^,^{Z^\@^B) > nlog^, (8.11) 

where c is the overlap of Alice's measurements (one one qubit). In the case 
of entanglement-based BB84 as described above, we have c = ^ and, thus, 
the left-hand side of (8.11) evaluates to n. This means that the smooth 
min-entropy of the eavesdropper's information about Z^ right after Alice's 
measurements is lower bounded by n minus the max-entropy of Bob's 
correlations with Z"". After Bob's measurement, using the data-processing 
inequality (cf. Theorem 5.7), we thus have 

//^i,(Z"|e"B)>n-FU(^"l^"). 

During information reconciliation, using a protocol as discussed in the 
direct part of Theorem 5.7, Alice sends a message M to Bob which satisfies 

logdM < F^/,2(^n|^n) ^21og- +6, 

where Cc is the required correctness. This ensures that the probability that 
Bob cannot correctly estimate Alice's string is at most Cc- Note that this 
message might be wiretapped by the eavesdropper and stored in a register 
M' . However, using Proposition 5.10, we bound 

Hf,,,{Z^\e^EM) > W^^{Z^\Q^E)-\ogd,, 

>n- H^^^^iZ^m - H^l^iZlZ-) - 21og - - 6. (8.12) 



£. 



c 



The direct part of Theorem 8.2 now states that we can extract at least i 
bits of e^-secure key, where 



res/2 



1 



^ > H"^(^^{Z''\e"EM) - 2 log 1 
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By the term e^-secure we mean that A{S\Q"'EMF)p < e^. This impHes 
that the secret key is in particular independent of the seed F, which Alice 
needs to send to Bob and which could be wiretapped by the eavesdropper. 
Using (8.12), we can express this bound entirely in terms of correlations 
between Alice and Bob, i.e. 

i>n- Hiil{Z^\Z-) - H^-l^iZ-\Z-) - 21og 1 - 21og 1 - 7. 

This means that if the classical correlations between Alice's and Bob's 
measurement outcomes are sufficiently good — namely if the corresponding 
smooth max-entropies are small — we can safely extract a secret key using 
this protocol. 

It thus remains to find a statistical test to determine upper bounds 
on the smooth max-entropies. This is usually done in the following way. 
Alice and Bob, after measuring, compare a random subset of their raw 
keys using the classical channel. Then, if the frequency of errors found in 
this sample is smaller than an agreed threshold value, they will proceed 
with the protocol and extract a key. In this case, it can be shown that the 
smooth max-entropy is indeed small, and the key thus secure. Otherwise, 
they abort and do not produce a key. (A detailed analysis of the statistical 
tests and the precise security statements that follow is beyond the scope 
of this thesis. Such an analysis can be found in [TLGRll], for example.) 
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Chapter 9 

Conclusions and Outlook 



The goal of this thesis was to consohdate the smooth entropy framework for 
non-asymptotic information theory and to introduce important additions 
to the framework, including the entropic asymptotic equipartition property 
and various uncertainty relations. My sincere hope is that this work will 
provide a reference for researchers interested in non-asymptotic quantum 
information theory and quantum cryptography. 

9.1 Applications of the Framework 

The smooth entropy framework has already found a wide range of appli- 
cations since its conception. 

In cryptography, particularly quantum key distribution, the smooth 
entropy framework has become a standard tool to analyze security for 
finite keys [Ren05, SR08]. This analysis was simplified for some proto- 
cols thanks to the entropic uncertainty relation [TSSRll, TLGRll]. The 
smooth entropy formalism allows to investigate entropically secure encryp- 
tion [FS08, DDIO], and, together with results from randomness extraction, 
we use it to show that bit commitments cannot be expanded [WTHRll] in 
a quantum world. Furthermore, composable security in the bounded stor- 
age [WW08] and noisy storage [KWW09] models is analyzed using smooth 
entropies. 

Decoupling of quantum systems [DBWRIO] can be viewed as a fully 
quantum generalization of randomness extraction and is characterized by 
smooth entropies. The decoupling approach — a quantum generalization 
of random coding — leads to direct bounds for many information theoretic 
tasks. (Some of them are listed in [Dup09] and [DBWRIO].) We have also 
shown that decoupling is possible using approximate two-designs [SDTRll], 
which suggests that decoupling can be achieved efficiently in nature. 

The smooth entropy formalism has also been used to investigate vari- 
ous quantum channel capacities and converses [BDIO, DHBll, DHll]. In 
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particular, it leads to a conceptually simple proof of the quantum Reverse 
Shannon Theorem [BCRll]. 

In thermodynamics, smooth entropies have been used to quantify work 
extraction [DRRVll] and the work cost of erasure [dRAR"*"!!] in small 
systems. The smooth entropies have also been used to investigate ther- 
malization [HWll]. 

Finally, it has been shown [DR09] that the smooth entropy frame- 
work entails the information spectrum method by Han and Verdu [HV93, 
Han02] and its quantum generalization due to Hayashi, Nagaoka, and 
Ogawa [HN02, NH07, NOOO]. 

9.2 Outlook and Open Questions 

Some of the technical results in this thesis are new and applications re- 
main unexplored. The entropic asymptotic equipartition property pre- 
sented here has been improved from earlier work [TCR09] and now pro- 
vides a converse bound for finite smoothing. This may help in the quest to 
prove strong converse statements for various tasks in quantum information 
theory, including channel capacities. These arguments apply to classical 
theory as well and it remains to be seen how these converse bounds com- 
pare to the literature. 

In Chapter 7, Eq. (7.15), it was conjectured that the effective overlap 
is always smaller than the overlap. It remains an open question to show 
that this is true for general POVMs. 

The entropic uncertainty relation has been generalized from previous 
work [TRll]. The lower bound on the uncertainty is now expressed using 
an effective overlap. In turn, this effective overlap can be bounded in terms 
of the maximal CHSH value that can be reached — using the same mea- 
surement setup — with an arbitrary second party. The CHSH value is mea- 
sure of the non-locality of classical correlations produced by two parties. 
We thus believe that the generalized uncertainty relation provides a new 
avenue into device- independent quantum cryptography [HTll, LPT+11]. 
More generally, the applications of the uncertainty relation for smooth 
entropies in quantum cryptography are not fully explored yet. Another 
interesting extension is the bipartite uncertainty relation. It has potential 
applications in two-party quantum cryptography, which are unexplored. 

Finally, the recent addition of a complete set of chain rules for the 
smooth entropies provides an important missing link and will lead to an 
even larger range of applicability of the framework. 
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Appendix A 

Various Lemmas 

A.l Two Lemmas for Tensor Spaces 

The following Lemma gives an operator inequality that relates the marginal 
states on one Hilbert space before and after a CPM is applied on the other 
space. 

Lemma A.l. Let M G V{'K Ji') and let t he a CPM from Ji' to Ji" . 
Then, 

tr„"(£[M])<||£t[l]||^tr«.(M). 

Proof. We write £ in its Kraus representation, i.e. £[M] = "^j^ Ej^M E'^, 
where Ek G C{'K','K"). Due to the linearity and cyclicity of the partial 
trace, we have 

tr^-.(£[M]) =tr^>(j2ElEkM^ = tr«.(£t[l] M) 

k 

We introduce the operator R = l||£'l"[l]||oo — £^[i] > and note that 
tr^'iVRMy/R) > and, thus, 

tr«.(£t[l]M) <tr^.((£t[l] + R)M) = ||£t[l]||^ tr_„-(M). D 

In particular, this lemma implies that TP-CPMs do not affect the 
marginal state on another space. To see this, note that the bound evaluates 
to trj,"(£[M]) < tr„'(M). Hence, the operator trj,'(M) - tr5,"(£[M]) is 
positive and has vanishing trace, implying that the two marginal states are 
in fact equal. Moreover, if the map £ is trace non-increasing, we still find 
trM"(£[M]) < tr5^;'(M) as its adjoint is sub-unital. 

As a further example, consider the CPM £ : M i— )■ LML^ that con- 
jugates M with an arbitrary linear operator L € £(3f',^"). In this case, 
£' [1] evaluates to UL = |Lp and we find 

tr^"(£[M]) < |||L|2||^tr„-(M) = ||L||^ tr«-(M). (A.l) 
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To simplify the expression, we used that ||iV^||oo = II -^i^ ^oi N G ViH) 
and that |||L||| = ||L|| due to the unitary invariance of the norm applied to 
the polar decomposition of L. 

Moreover, the following operator inequality holds. 

Lemma A. 2. Let M G P(:K® CK'). Then, M < dim{5{'} tr„'(M) !„'. 

Proof. Since M has an eigenvalue decomposition with positive eigenvalues, 
it is sufficient to prove the property for an operator M of the form M = 
\(p){ip\, where cp G K<^ "K' . The general result then follows by linearity. 

Let X = tr5^;'(M) and A = X^^/^\(f){ip\X^^/'^ , where we take the gen- 
eralized inverse of X. Since tr(A) = rankjX} < dim{!K'} due to the 
Schmidt decomposition, we have A < dim{5{'} 1. The statement of the 
lemma than follows by conjugating this inequality with X^'^. D 

A. 2 Entropies of Coherent Classical States 

The following two technical lemmas are useful when coherent classical 
states are discussed. 

Lemma A. 3. Let r G S^CKxx'ab)- Then, the corresponding coherent 
classical state p = Hxx'T^xx' satisfies 

H„,,,{XA\X'B)r < H^^XXA\X'B), < H^,,{A\XX'B)p . 

Proof. By the definition of the min-entropy, there exists a state a G 
S={'Kx'b) such that 

TXX'AB < 2'^rnini^^^^'^'^^lxA (J X' B ■ 

Therefore, by conjugation with IIxx', we have 

Pxx-AB < 2'-f^min(^^l^'s).i^ ^ Y^ |a.^^3,|^ ^ \x){x\x' ® {x\ax'B\x)^, 

Since the measurement M^' is trace-preserving, Mx' [c] is a candidate for 
the optimization of H^:^^{X A\X' B) p and the first inequality of the lemma 
follows. The second inequality follows by a similar argument, where ax'B 
is chosen to maximize H^^:^^{X A\X' B) p and the state 

CTxx'B = ^ \x){x\x <^ \x){x\x' «> {x\ax'B\x)^, 

X 

is a candidate for the optimization of H^^^^J^A\XX'B). D 
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Lemma A. 4. Let p G S<{'Kxx'ab) be coherent classical between X and 
X'. Then, 

H^,,^{XA\X'B), < H^,^{A\XX'B)p. 

Proof. Since the coherent classical state p commutes with Hxx'^ there 
exists a state a G S={^x'b) such that 

9-H'max(''<^-4|X'B) _ II /— - fZk f5>, _ II 

■^ — \\y Pxx' AB\ ^xAyy <Jx' b\\i 

= \^/PxX'AB^/'^A®^XX'{'^X®Crx'B)'^XX'\i 

The last inequality follows from the fact that tr {llxx'{'^x ® (^x'b)^xx') = 
tr (Mx'[f]) = 1, which makes this state a candidate for the optimization 
mH^,^{A\XX'B). D 

A. 3 Selected Relations between Entropies 

The first lemma appeared in [TSSRll] and relates the min-entropy and 
the relative entropy where the conditioning is done on p. 

Lemma A. 5. Let e > and p G S<{'Kabc) be pure. Then, there exists a 
projector Hac on 'K^.c o-nd a state p = HpH such that p G 13p{p) and 

2 

SminiPABW^A <8) Pb) > H^^^{A\B)p - log ^ . 

Proof. The proof is structured as follows: First, we give a lower bound on 
the entropy SmmiPABW^A'S^ Pb) in terms oi H^^j^{A\B)p and a projector lis 
that is a dual projector of Hac in the sense described below. We then find 
a lower bound on the purified distance between p and p in terms of Hg 
and define 11^ (and, thus, Hac) such that this distance does not exceed e. 
Let A and a be the pair that optimizes the min-entropy H^^^{A\B) , 
i.e. H^^^{A\B)p = 5min(pAs||lA<X'o-s) = A. We have pg < pg by definition 
of p. Hence, SminCpAsUlA (d) pg) is finite and can be written as 

^ — \\Pb PabPb W^ ■ l^-^J 

We bound this expression using the dual projector of IIac, n^, i.e. the 
projector that satisfies (cf. Lemma 2.1) 

n^lr) = n^clr), where IP) = p~g'\) = Pa'Ap) = vec(n''s) 

is the unnormalized fully entangled state between supp {pg} and supp {pac}- 
We also use that pAg < 2^'^lA^o-g by definition of the min-entropy. Thus, 
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we bound the rhs. of (A. 2) as 



rhs. = ||trc((n^c i^ Pb ) Pabc {^ac <» Pb )) 



loo 



oo 



llrr -V2 -V2tt I 

= WnsPB pabPb n^i 

< 2-^\\tj, ® (Hs pt'^as pt'^IiB)\\^ 

— 9^'^iitt a tt ii 

— ^ W^^b-'-^b^^bWoo ) 

where, m the last step, we mtroduced the Hermitian operator A^ := 

— \J2 1/2 

Pg Ob Pb ■ Taking the logarithm on both sides leads to 

5min(pAs||lA^Ps) > i/mm(^|-B)p-log||nsAsns||oo. (A.3) 

We use Lemma 3.8 to bound the distance between Pabc and Pabc-, namely 
P{p, p) < V2tr(nicPAsc) = ^/2ii{IiipB) , (A.4) 

where the n^^-, and 11^ are the orthogonal complements of Hac and IIb, 
respectively. 

Clearly, the optimal choice of 11^ will cut off the largest eigenvalues of A 
in (A.3) while keeping p and p close. We thus define n^ to be the minimum 
rank projector onto the smallest eigenvalues of A such that tT{U.BPB) > 
tr(p)— e^/2 or, equivalently, iviJi^pB) < e^/2. This definition immediately 
implies that p ^e P and it remains to find an upper bound on insAsIlBloo. 

Let n^ be the projector onto the largest remaining eigenvalue in the 
operator Hb^b^b and note that 11'^ and IT^ commute with A^. Then, 

lin.A.n.iu = tr(n;A,) = ,,i, tr(^,(nj, + n-,)A,) ^ 

M tr(/ij 

where p is minimized over all positive operators in the support of Tig + 11^ . 
Fixing instead Pb = (Hg + I\'g)pB{^B + n'^), we find 

lin A n II < ^^i^fpB^'m + '^'B)) . tTJvfpBKf) ^ ^ 
||ll.A,ii,|U _ ^^((jj, ^ ^,^^^^^ _ ^^((nx + Yi',)pB) - e^ ■ 

In the last step we used that 

2 
ti{AfpBAf) = tr(cTs) = 1 and tr((n^ + U'g)pB) > y 

by definition of 11^ . This concludes the proof. D 

In addition, we need the following extension of this result to the smooth 
min-entropy. 
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Lemma A. 6. Let e > 0,e' > and p G 5<(!K^s). Then, there exists a 
state p G B'^^'^'' (p) such that 

Sram{pAB\\tA «> Pb) > H<^{A\B)p - log ^ . 

Proof. Let p G 5<(^asc) and p £ B^ (Pabc) be pure states such that 
H^^^^{A\B)p = H^^^{A\B)p. We apply Lemma A. 5 to this state to get 

2 

Smm{pAB\\tA (8) Pb) > H^i^{A\B)p - log ^ , where \p) = IlAc\p),P^e P 

Using Lemma 3.7, we define the operator Fg with the property FbPbFb = 
Pb] hence FbPabcFb ~e' Pabc- Applying this to the defining operator 
inequality of the relative entropy above leads to 

pAB < 2^-^Ia ®Pb^ Pab ■■= FbPabFI < 2~^1a O pg 

and, thus, SminipABW^A i^ Pb) < Smm{pAB\\^A <^ Pb)- Furthermore, p G 
S<{JiABc) since tr(^) = ^{FbPbFI) < tT{FBPBFl) = tr(pB) < 1. Hence, 
it remains to bound P{p, p) < P{p, p) + Pip, p) + Pip, p) < -P(p, p) +e + e' 
and 

P{p, p) = P{{Fb (S) Uac) Pabc{Fb ® n^c), n^c Pabc^ac) 
< P{FbPabcFI,Pabc) < e', 

where we used the monotonicity of the purified distance (cf. Theorem 3.4) 
under projections. This concludes the proof. D 
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Appendix B 

Properties of 
Quasi-Entropies 



This appendix discusses properties of the quasi-entropies and relative Renyi 
Entropies introduced in Chapter 6. 

B.l Properties of Quasi-Entropies 

The quasi entropies (cf. Def. 6.1) are weh-defined in the sense that they 
are covariant under isometrics on A and B. 

Lemma B.l. Let U : "K ^ "K' he an isometry. Then, for all f -quasi 
entropies and for all A,B£ 'P(IK), we have 

Sf{A I B) = SfiUAU^ II UBU^) . 

Proof Let A = X^j Aj|ei)(ej| and B = Ylj l^j\fj)Uj\ with eigenvalues Aj > 
0, //j > 0. Now, we write (see also [NOOO]) 

5;(^||B) = limj;(/.,+0/(^)|(e.|/,)^ 

The isometry U keeps eigenvalues and the scalar product (ej|/j) invariant. 
Furthermore, any zero eigenvalues introduced do not contribute since they 
lie in a space orthogonal to the image of U , where the summands vanish 
since lim^^o C/(0) = 0.^ D 

Furthermore, it turns out that quasi-entropies have nice properties if 
the function / is chosen operator concave. In particular, the following 



^In Def. 6.1, we require that /(O) £ R is finite. 
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property holds. ^ 

Lemma B.2. Let E be a TP-CPM. Then, for all f quasi- entropies with f 
operator concave and for all A,B £ 'P{'K), we have 

Sf{A\\B)<Sf{E.{A)\\E.{B)). 

Proof. Any TP-CPM can be expressed as an isometry followed by a partial 
trace (cf. Lemma 2.4), hence, in conjunction with Lemma B.l, it remains 
to show the property for the partial trace operation. We will show this 
mider the assumption that B is invertible and the result for general B will 
follow from the continuity (by definition) of Sf when ^ — )• 0. 

To show monotonicity under partial trace, we let 'K = IKi (8) 'K2 with 
bases {\i)i} and {1^)2}, respectively. We use the (unnormalized) fully 
entangled state I7) = ^^ • (|i)i \j)2) '^ (|^)i €) \j)2) in the product basis 
and its marginal |7)i = Xli K)i <SD |i)i. It remains to show that Sf{A\\B) < 
Sf{Ai\\Bi), where Ai = tr^^iA) and Bi = tViiB). 

Let us define a linear map V : "Ki (E> "K'^ -^ 'K^'K' hy 



y := ^ (/B (^i ' 1^)2)) O li \i)2 ■ 



The map V is an isometry, i.e. V'V = In and satisfies 

F(/b^®1i|7)i) = Vi?®li2|7)- (B.l) 

We have tr2(A^) = Af, since the transpose is taken in the product basis. 
Hence, V'^{B~'^ (g) A'^)V = B^^ (g) AJ. Next, we apply the Operator Jensen 
Inequality (Lemma 2.8) to get 

V^f{B-' A'')V < f{v\B-' ® A')V) = f{B{' .4^) . 

Finally, using (B.l), we recover Sf{A\\B) < Sf{Ai\\Bi) by taking the 
matrix element for (\/^'8' li)|7)i on both sides of the inequality. D 

B.2 Properties of the Renyi Entropy 

Here, we prove some general properties of the relative Renyi entropies, 
introduced in Chapter 6 as 

S^{A\\B) = j^logtr {A^B'-''). 



^This was essentially already shown in [Pct84] for the partial trace (see also [Hay06]) 
up to the continuity arguments above that allow us to define the quasi-entropies for non- 
invertible B. This extension (cf. Lemma B.l) is crucial, since even if B is invertible, 
£(-B) is generally not if £ is an isometry. Hence, our contribution is to extend the 
monotonicity argument to general TP-CPMs. 
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Note that a similar quantity appears in quantum hypothesis testing [NOOO, 
AMAV07] and in [Hay06, OP93, MD09], where alternative proofs of some 
of the following properties can be found. 

Unlike their classical counterparts, the quantum relative (and condi- 
tional) min- and max-entropies cannot be recovered as special cases of 
a-entropies. However, it can be shown [KRS09] that 



Sy,{A\\B) = 2logtr{VAVB) < 2logtT\^/A^/B\ = Sn,^^{A\\B) . (B.2) 

Furthermore, using the eigenvalue decompositions A = J2i ^i\^i){^i\ ^^'^ 
B = Y.J fJ'j\fj){fj\, we have 

SooiA\\B) = \im-\og max -^"^ < Sram{A\\B) . 

The entropies are additive, e.g. evaluation for an i.i.d. operator A®"' 
relative to another i.i.d. operator B^"" results in 

5«(^®"||5®'^) = nSa{A\\B) . (B.3) 

The relative Renyi entropies decrease monotonically in a. 

Lemma B.3. Let a > /3 > and let p G S={'X), a G V{'K) OTi/isupp{p} C 
suppjo"}. Then, Sa{p\\(y) < Sf3{p\\a). 

Proof. We first show that Sa decreases monotonically with increasing a 
by showing that its derivative is negative for all a G [0, 1) U (1, oo). 

Using the (unnormalized) fully entangled state I7), we define a pu- 
rification \if) := (y^ (g) 1)17) of A. Furthermore, we set a = a — 1 and 
X = (p® a~^)'^. It is easy to verify that, for / : 1 1— )• tlogi, 

Sa{A\\B) = -^ log {^\X^\V) and 
a 

f{{ip\x^\ip))-W{x^M 



a^^\X^\^) 



<0. 



The monotonicity follows from the convexity of / together with Jensen's 
inequality (cf. Lemma 2.7). D 

The Renyi entropies cannot decrease under simultaneous application 
of a CPM on both arguments. This is equivalent to a data-processing 
inequality for the conditional version of the entropies.^ 



^Conversely, it is easy to see that the data-processing inequality for the min- and 
max-entropy (cf. Theorem 5.7) imply monotonicity for Smax and S'min- 
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Lemma B.4 (Monotonicity of Renyi Entropy). Let a G [0,2], let A,B£ 

V{:K) and let t he a TP-CPM. Then, Sa{A\\B) < Sa{e{A) ||£(S)). 

Proof. This follows directly from Lemma B.2 and the fact that g^ '■ t ^^ t" 
is operator concave for a G (0,1) and operator convex for a E (1,2]. 
(Also note that the pre-factor conveniently changes sign between these 
two domains.) In the limits a — )• and a — )• 1, the property follows by 
continuity.^ D 

We define conditional versions of these entropies as follows. For a state 
p G 5<(!K^a), the conditional Renyi a-entropy of A given B is 

H^{A\B)p := Sa{pAB\\tCd Pb) ■ 
This definition allows a duality relation for pure tri-partite states. 
Lemma B.5. Let p G S<{'}iABc) be pure and a £ [0,1) U (1,2]. Then 

H^{A\B)p + H,_^{A\C), = 0. 

Proof. We write p = \'&){i!)\ and note that the marginal states Pab and pc 
satisfy PabW) = Pc\'&) and ps|??) = PacW)- Thus, 

(1 - a)H^{A\B), = logtr (pSsPb"") = log (^Ip^bVb""!^) 

= log(^|prVic"l^> = (« - l)H,_^{A\B)p. 

The last equality follows from a — 1 = 1 — (2 — a). D 



^For the von Neumann entropy this property (and strong sub-additivity) also follows 
from the operator concavity of /i : i i— >■ — tlogf, as noted in [NP05]. 
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e-ball, 45, 56, 71 
e-close, 53 
e-smooth, 45, 56 

adjoint, 24 
AEP, 17, 87, 127 
algebra, see linear algebra 
angular distance, 49, 50 
asymptotic equipartition, see AEP 

basis, 23 

orthonormal, 23 
Bures metric, 49 

chain rule, 17, 85 
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coherent classical, 38, 40, 81, 154 

concave, 42, 70, 159 

conjugation, 31 

contraction, 95 

convergence parameter, 97 

converse bound, 16, 101, 127, 130, 

131, 133 
convex, 42, 70 
covariant, 159 
CPM, 5, 31, 153 
CQ, see state 

data processing, 11, 17, 70-72, 79, 

127 
decomposition, 23, 24 

eigenvalue, 29 

polar, 27, 53, 154 

Schmidt, 28, 63 

singular value, 27 
dimension, 23, 34, 35 
direct bound, 15, 100, 130, 133 



direct sum, 23 

distinguishing advantage, 47, 49, 132 

duality, 17, 63, 72, 77, 162 

effective overlap, see overlap 

eigenbasis, 29 

eigenvalue, 29, 63 

eigenvector, 29 

embedding, 26 

endomorphism, 25 

entangled, 37 

entanglement, 8, 9, 13, 14 

entropy, 10, 45 
Hartley, 15, 16 
max-entropy, 15, 56, 60, 65 
min-entropy, 56, 58, 65, 155 
Renyi, 15, 55, 91, 95, 160 
Shannon, 10, 13, 93 
smooth, 15-17, 71, 73, 94, 103 
von Neumann, 12, 13, 55, 62, 
88, 93, 103, 129 

event, 10, 12 

evolution, 39 

extension, 38 

fidelity, 47 

generalized, 48 

generalized fidelity, see fidelity 
generalized inverse, 93, 94 
guessing probability, 64, 84 

Hamiltonian, 39 
Hermitian, see operator 
hidden variable, 9 
Hilbert space, 21, 27, 29, 35 
homomorphism, 24 
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i.i.d., 5, 11, 13, 87, 129 

i.i.d. limit, 12, 14, 17, 90, 127, 134 

identity, 25 

image, 25 

indicator, 23 

information theory, 9 

inner product, 21 

Hilbert-Schmidt, 27 
inverse, 25 

generalized, 25, 53, 154 
isometry, 26 

partial, 26, 51 
isomorphism, 25, 27 

Choi-Jamiolkowski, 32, 59 

kernel, 25 

Ihs., 5 

linear algebra, 21 

marginal, 10, 37, 112, 153 
max-entropy, see entropy 
measurement, 12, 39 

incompatible, 13, 103 

projective, 40, 79 
measurement setup, 112 
metric, 22, 47, 50 
min-entropy, see entropy 
modulus, 27 
monotone, 42 

no-cloning, 9 
norm, 22 

Schatten, 28 

operator, 28 
normal, 29 
notation, 5, 6, 53 

observable, 103 

observer, 8, 11 

ignorant, 8, 13, 18 
omniscient, 8, 9, 18, 60 

one-shot, 14, 16, 17, 55-57, 88, 127, 
134 

operator, 24 



Hermitian, 29 

identity, 25 

Kraus, 33 

linear, 24 

positive, 30 

positive semi-definite, 30 

unitary, 26 
orthonormal, see basis 
overlap, 104, 108, 109 

effective, 108, 112 

partial trace, 32 
partial transpose, 37 
POVM, 5, 40, 104 

element, 40 
probability, 8, 10, 36, 40, 132 
projector, 25, 48, 51, 53 
purification, 38, 48, 61 
purified distance, 16, 46, 49, 72, 
127 

QKD, 5, 127, 134 
quantum 

cryptography, 9, 134 

mechanics, 7, 34 
quantum channel, 39 
quantum state, 12, 13, 35 
qubit, 12, 35 

random variable, 10, 36, 40 
randomness extraction, 17, 127, 132 
rank, 25 
register, 12, 36, 39, 63, 72, 80, 104, 

132 
representation 

Kraus, 33, 153 
resource, 12, 14, 15 
rhs., 5 

SDP, 5, 41, 58, 59 

self-adjoint, 29 

semi-definite program, see SDP, 56 

separable, 37 

side information, 10, 11, 127, 132 

smoothing, 15, 56, 73, 74 
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smoothing parameter, 45 
source compression, 14, 127 
span, 22 
state 

CQ, 5, 38, 64, 81, 104 

joint, 36, 38 

mixed, 36 

pure, 36, 38, 162 
state merging, 14, 17 
statistical distance, 16 
Stinespring dilation, 33, 40, 80, 85 
sub-multiplicative, 29 
sub-unital, 32 
super-operator, 31 
support, 25 

surprisal, 10, 11, 13, 16 
symmetric, 48 
system, 12, 35 

joint, 36 

tensor product, 23, 30 

TP-CPM, 5, 31, 39, 51, 59, 72, 79, 

81, 153 
trace, 26, 32 

non-increasing, 32 

preserving, 31 
trace distance, 46, 47 
trace non-increasing, 51 
transpose, 25 
typical set, 87, 89 

UCR, see uncertainty 
Uhlmann's theorem, 48, 52, 61 
uncertainty, 10 

principle, 8 

relation, 5, 103 
uncertainty relation, 13 
unital, 32, 59, 79 
unitarily invariant, 29, 76, 154 
unitary, 39 

vector space, 21 

von Neumann entropy, see entropy 
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